Skip to content

CMER-SML #207

@YAMing11

Description

@YAMing11

Title: Question about SML labels in the released CMER training code

Hi, thanks for releasing the CMERNet implementation and the CMER/MER datasets.

I am trying to reproduce the training pipeline described in the paper. In the paper, CMERNet introduces Structured Mathematical Language (SML), where the raw LaTeX string is first parsed into a syntax tree and then serialized into a structured token sequence with grammar tokens.

However, when I checked the released OpenOCR CMER configuration and README, the training label format seems to be:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions