Skip to content

Support different audio / directory structures #18

@evamaxfield

Description

@evamaxfield

Feature Description

Support a more direct directory structure of speakers instead of "conversations". i.e.:

data/
├── bob/
|   ├── 0.wav
|   ├── 1.wav
|   ├── 2.wav
|   ├── 3.wav
|   ├── 4.wav
├── sally/
|   ├── 5.wav
|   ├── 6.wav
|   ├── 7.wav
|   ├── 8.wav
|   ├── 9.wav
└── eva/
    ├── 10.wav
    ├── 11.wav
    ├── 12.wav
    ├── 13.wav
    ├── 14.wav

Where all the audio for each speaker is provided as a directory. This would involve creating new functions for preparing the dataset -- with no guarantee that the "conversation id" holdout condition is met.

Use Case

See #17 -- direct use case already done.

Solution

Alternatives

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions