Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Recognise Elan media offset when reading annotation start/end times #193

Open
benfoley opened this issue Mar 18, 2021 · 3 comments · May be fixed by #215
Open

Recognise Elan media offset when reading annotation start/end times #193

benfoley opened this issue Mar 18, 2021 · 3 comments · May be fixed by #215
Assignees
Labels
data-prep UI Involves working on the elpis/gui front end JS codebase

Comments

@benfoley
Copy link
Contributor

Elpis currently expects annotations to correlate to beginning of media. However, ELAN files can have offset media start times which would result in mis-aligned training.

When reading the annotation data from ELAN files, we should look in the header for media offset information and add/subtract that to the annotation times so that the alignment is correct.

Or trim the media sox input output trim 5000?

@benfoley benfoley added UI Involves working on the elpis/gui front end JS codebase data-prep labels Mar 18, 2021
@benfoley
Copy link
Contributor Author

@benfoley make an example ELAN file to demonstrate.

@mattchrlw mattchrlw self-assigned this Mar 22, 2021
@benfoley
Copy link
Contributor Author

The offset is the TIME_ORIGIN value.

        <MEDIA_DESCRIPTOR
            MEDIA_URL="file:///Users/bbb/Desktop/abui/abui-audio-1.wav"
            MIME_TYPE="audio/x-wav"
            RELATIVE_MEDIA_URL="./abui-audio-1.wav" TIME_ORIGIN="1000"/>

@benfoley
Copy link
Contributor Author

benfoley commented Mar 22, 2021

Here's some sample data, adjusted the Abui toy corpus with various offset values.

abui-offset.zip

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
data-prep UI Involves working on the elpis/gui front end JS codebase
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants