Skip to content

Speech-to-Text Model Comparison #34

@Cgarg9

Description

@Cgarg9

Description:

To help users understand different speech recognition methods, add a notebook that applies multiple models on the same dataset and compares results.

Tasks:

  • Compare CMU Sphinx, DeepSpeech, Wav2Vec 2.0, OpenAI Whisper.
  • Provide Word Error Rate (WER) and Sentence Error Rate (SER) comparisons.
  • Summarize key use cases and limitations for each model.
  • Name the notebook speech_to_text_comparison.ipynb.
  • Update the README file with relevant references.

Metadata

Metadata

Assignees

No one assigned

    Labels

    mediummedium level difficultypwoc

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions