Does SMI-TED model support SMILES with stereochemical information?

Hi team,

I noticed that in the function normalize_smiles (at [load.py#L41](https://github.com/IBM/materials/blob/3b91abe7db153c8188bf7cf0e247cb4fba211c6e/models/smi_ted/smi_ted_light/load.py#L41) ), the parameter isomericis set to False by default:
```
def normalize_smiles(smi, canonical=True, isomeric=False):
    try:
        normalized = Chem.MolToSmiles(
            Chem.MolFromSmiles(smi),
            canonical=canonical,
            isomericSmiles=isomeric
        )
    except:
        normalized = None
    return normalized
```

As far as I understand, this means stereochemical information in SMILES (such as @, /, or \) will be lost during normalization.

I'd like to confirm:

1. Is the current SMI-TED model designed not to support SMILES with stereochemical information?

2. If that's the case, is there any plan or recommended way to handle isomeric SMILES (eg, enabling isomeric=Trueduring preprocessing)?

Thank you for your time and clarification!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Does SMI-TED model support SMILES with stereochemical information? #59

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Does SMI-TED model support SMILES with stereochemical information? #59

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions