Add expansion_type parameter to API call from eland_import_hub_model by daixque · Pull Request #802 · elastic/eland

daixque · 2025-07-22T09:00:09Z

Overview

This PR is related to the Elasticsearch change: elastic/elasticsearch#131679

In the above PR, Elasticsearch supports sparse embeddings including non-ELSER models. A significant example of a sparse vector model is the SPLADE model, which is a reference model for ELSER.
To inform Elasticsearch that the target model is for a SPLADE type one, this PR introduces a new expansion_type parameter when it calls the create trained model API.

How Eland detects the SPLADE model

Eland identifies the model as a SPLADE model by checking the dimention of the output tensor. If the second dimension of the output tensor is over 1, it is considered a SPLADE model. This is because SPLADE models typically output embeddings per token, which is different from ELSER.

In the current codebase, text_expansion is used in anywhere. (No sparse_embedding)
https://github.com/search?q=repo%3Aelastic%2Feland%20text_expansion&type=code

% grep -inR "text_expansion" eland tests | grep -v "Binary file" eland/ml/pytorch/transformers.py:76: "text_expansion", eland/ml/pytorch/transformers.py:97: "text_expansion": TextExpansionInferenceOptions, eland/ml/pytorch/transformers.py:557: elif self._task_type == "text_expansion": eland/ml/pytorch/transformers.py:747: if self._task_type == "text_expansion": eland/ml/pytorch/nlp_ml_model.py:320: super().__init__(configuration_type="text_expansion") tests/ml/pytorch/test_pytorch_model_config_pytest.py:149: "text_expansion", tests/ml/pytorch/test_pytorch_model_config_pytest.py:217: if task_type == "text_expansion":

Should I rename everything? It will cause CLI interface change. Should we keep --task-type=text_expansion for the compatibility? (I feel that renaming should be another PR)

Ah ok thanks. Yes the rename is not necessary in this PR

Got it, thanks

eland/ml/pytorch/transformers.py

Co-authored-by: David Kyle <david.kyle@elastic.co>

Add expansion_type parameter to TextExpansionInferenceOptions and upd…

317287b

…ate inference logic for text expansion task

daixque requested a review from davidkyle July 22, 2025 09:00

daixque mentioned this pull request Jul 22, 2025

[ML] SPLADE embedding support elastic/elasticsearch#131679

Open

Add test for splade config

b80b67c

davidkyle reviewed Jul 22, 2025

View reviewed changes

daixque and others added 2 commits July 23, 2025 23:00

Add restriction to set the expansion_type parameter

e86cd0f

Co-authored-by: David Kyle <david.kyle@elastic.co>

Merge branch 'main' into add_expansion_type_param

a5ffba2

+                      elif self._task_type == "text_expansion":
+                          sample_embedding = self._traceable_model.sample_output()
+                          if type(sample_embedding) is tuple:
+                              text_embedding = sample_embedding[0]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add expansion_type parameter to API call from eland_import_hub_model#802

Add expansion_type parameter to API call from eland_import_hub_model#802
daixque wants to merge 4 commits intoelastic:mainfrom
daixque:add_expansion_type_param

daixque commented Jul 22, 2025 •

edited

Loading

Uh oh!

davidkyle Jul 22, 2025

Uh oh!

daixque Jul 23, 2025 •

edited

Loading

Uh oh!

davidkyle Jul 23, 2025

Uh oh!

daixque Jul 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

daixque commented Jul 22, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Overview

How Eland detects the SPLADE model

Related

Uh oh!

davidkyle Jul 22, 2025

Choose a reason for hiding this comment

Uh oh!

daixque Jul 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

davidkyle Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

daixque Jul 23, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

daixque commented Jul 22, 2025 •

edited

Loading

daixque Jul 23, 2025 •

edited

Loading