Clipping subsampler refactor #275

MattUnderscoreZhang · 2024-01-19T02:37:27Z

I need to add ffmpeg pipes to speed up processing, and to do that I first needed to refactor some of the code to make it easier to reason about.

For context, I'm processing webvid-10M and the codebase is currently too slow. I'm hoping to speed it up by an order of magnitude by combining subsamplers into a single ffmpeg pipe operation, and adding other optimizations.

…vals

…tion timeouts

iejMac · 2024-01-19T19:03:45Z

nice, yes this is a great idea

iejMac · 2024-01-19T19:04:39Z

is this ready to go? If so could you give a high level overview of the changes?

MattUnderscoreZhang · 2024-01-19T19:50:17Z

This is basically just a refactor. I broke the code into smaller functions, added types, only process metadata once per clip (instead of once per stream), renamed some stuff for clarity, and cleaned up the segment_times collection code.

I avoided making any functional changes, but I think I see a couple of places for improvement that I'll hit in later pull requests. The ffmpeg pipe stuff is also going to come later, but I may need to refactor the other subsamplers first.

rom1504 · 2024-01-21T19:32:06Z

looks much better

I think the smaller functions could be an opportunity to add some more tests. What do you think?

rom1504 · 2024-01-21T19:51:28Z

just merged #262 and having a bit of trouble to rebase here hmm

rom1504 · 2024-01-21T20:04:08Z

probably best if you handle this rebase @MattUnderscoreZhang ; I think it's mostly about replacing the code that's doing the extract subtitle by that new function

…efactor

MattUnderscoreZhang · 2024-01-22T03:37:31Z

Yeah no prob, I just merged changes.

rom1504 · 2024-01-22T21:57:49Z

@MattUnderscoreZhang did you test this to produce the same results as before? this is a lot of complex code that is changed

MattUnderscoreZhang · 2024-01-24T00:48:59Z

Yes, I get the exact same results as before for my use case, which involves a frame rate resampling, resizing, cropping, and clipping.

Plus, all the unit tests pass as expected.

MattUnderscoreZhang added 10 commits January 18, 2024 10:33

ClippingSubsampler rewrite and bug fixes

2efa849

More refactoring of ClippingSubsampler, plus a fix to _get_clip_inter…

a5c9649

…vals

Finished refactoring ClippingSubsampler

2cb5854

Merge branch 'clipping_subsampler_rewrite' into all_fixes

6106f62

Final code changes

5d03b72

Added docstrings

47c7d64

Passed tests and linting

5aa84d4

Made type annotations consistent with Python 3.8

140e1ab

More annotation fixes

077ca27

The Python 3.8 annotation needs a lot of hand-holding, it seems

32fa4ea

MattUnderscoreZhang force-pushed the clipping_subsampler_refactor branch from b202296 to 32fa4ea Compare January 19, 2024 03:17

MattUnderscoreZhang added 2 commits January 19, 2024 00:00

Pylint has to cut it out, I swear to God

5a8957f

No real change, just relauching unit tests which failed due to connec…

f0f0168

…tion timeouts

Merge branch 'main' into clipping_subsampler_refactor

f5d7c85

Merge branch 'main' into clipping_subsampler_refactor

388f51a

Merge remote-tracking branch 'origin/main' into clipping_subsampler_r…

5101379

…efactor

MattUnderscoreZhang added 2 commits January 21, 2024 22:46

Linting issue

1df88dd

Another linting issue

226fba3

MattUnderscoreZhang mentioned this pull request Jan 24, 2024

Subset worker refactor #287

Merged

rom1504 merged commit e7a4591 into iejMac:main Jan 24, 2024
2 checks passed

pabl0 mentioned this pull request Jan 31, 2024

list index out of range #303

Open

pabl0 mentioned this pull request Mar 7, 2024

YouTube metadata is not saved #319

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clipping subsampler refactor #275

Clipping subsampler refactor #275

MattUnderscoreZhang commented Jan 19, 2024 •

edited

Loading

iejMac commented Jan 19, 2024

iejMac commented Jan 19, 2024

MattUnderscoreZhang commented Jan 19, 2024 •

edited

Loading

rom1504 commented Jan 21, 2024

rom1504 commented Jan 21, 2024

rom1504 commented Jan 21, 2024

MattUnderscoreZhang commented Jan 22, 2024

rom1504 commented Jan 22, 2024

MattUnderscoreZhang commented Jan 24, 2024 •

edited

Loading

Clipping subsampler refactor #275

Clipping subsampler refactor #275

Conversation

MattUnderscoreZhang commented Jan 19, 2024 • edited Loading

iejMac commented Jan 19, 2024

iejMac commented Jan 19, 2024

MattUnderscoreZhang commented Jan 19, 2024 • edited Loading

rom1504 commented Jan 21, 2024

rom1504 commented Jan 21, 2024

rom1504 commented Jan 21, 2024

MattUnderscoreZhang commented Jan 22, 2024

rom1504 commented Jan 22, 2024

MattUnderscoreZhang commented Jan 24, 2024 • edited Loading

MattUnderscoreZhang commented Jan 19, 2024 •

edited

Loading

MattUnderscoreZhang commented Jan 19, 2024 •

edited

Loading

MattUnderscoreZhang commented Jan 24, 2024 •

edited

Loading