-
Notifications
You must be signed in to change notification settings - Fork 163
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Added video processing section (Unit 7 - Multimodal Based Video Model) #355
Added video processing section (Unit 7 - Multimodal Based Video Model) #355
Conversation
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
Co-authored-by: Jiwook Han <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the contribution and sorry for the late review.
It was a really nice read and I feel like I've got a better idea about multimodal video models now ;)
I mainly left formatting suggestions (mostly repetitive) and make sure to add the pictures :)
@@ -0,0 +1,120 @@ | |||
# Multimodal Based Video Models[[mutilmodal-based-video-models]] |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It is really nice that you thought about adding anchors, but actually this is not really needed, unless you want to refer back to the chapter within your own file.
In general the hf-doc-builder will create anchors for every headline automatically, so you can actually remove them :)
5. Depth Modality: Represents the 3D spatial information of the video. | ||
6. Sensor Modality: In some applications, videos may include modalities like temperature or biometric data. | ||
|
||
/* Modality Overview Image */ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Is there supposed to be an actual image here?
|
||
- **Overview** | ||
|
||
/* VideoBERT Overview Image */ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Again the image question ;)
|
||
- **Overview** | ||
|
||
/* VATT Overview Image */ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Image? (don't want to annoy you, just make sure you don't miss these :) )
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
|
||
- **Overview** | ||
|
||
/* ImageBind Overview Image */ |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Image ;)
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
chapters/en/unit7/video-processing/multimodal-based-video-models.mdx
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Hello, @johko ! First of all, sorry for the late reply. I really appreciate your detailed review! I committed most of your suggestions, removed the anchor and added images.
…ls.mdx Co-authored-by: Johannes Kolbe <[email protected]>
…ls.mdx Co-authored-by: Johannes Kolbe <[email protected]>
Co-authored-by: Johannes Kolbe <[email protected]>
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for the changes 🙂
Now it looks good to me 👍
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great additions..LGTM!
Added
multimodal-based-video-models.mdx
at video processing section. This document provides an overview of various multimodal video architectures which integrate different kinds of modalities into a unified representation space.Part of Proposed Outline Revision for Unit 7. Video & Video Processing #348
Who can review? (Initial)
@jungnerd @cjfghk5697 @mreraser and anyone who wants to review!