BasicVSR++: reproduce ntire decompression results on track3 #1216

sxd0071 · 2021-09-27T06:45:26Z

sxd0071
Sep 27, 2021

Hi,
Thanks for your great work.
When I use BasicVSR++ to reproduce ntire decompression results on track3 with the trained model you provided and official testset.
the settings include:
basicvsr_plusplus_c128n25_600k_ntire_decompress_track3.py and basicvsr_plusplus_c128n25_ntire_decompress_track3_20210304_6daf4a40.pth.

After testing, the Eval-PSNR is 30.0519, and the Eval-lq_PSNR is 28.3367. So I just gain 1.71dB improvements on track3. I set the num_input_frame as length of each sequence to use the full video sequence as inputs when I test.

Can you give some advice?

Answered by ckkelvinchan

Feb 14, 2022

Hi, I'm reproducing your paper, so could you release the ensemble test code?

Hello, you can refer to #585. The PR will be merged after review.

View full answer

ckkelvinchan · 2021-09-27T06:56:56Z

ckkelvinchan
Sep 27, 2021
Collaborator

We also used ensemble to further boost the performance. I am going to implement the ensemble in the following days (or weeks).

0 replies

Memnarch · 2021-10-06T08:38:04Z

Memnarch
Oct 6, 2021

Out of curiosity, can someone explain to me, what "ensemble" in this context means?

0 replies

ckkelvinchan · 2021-10-06T12:32:38Z

ckkelvinchan
Oct 6, 2021
Collaborator

Out of curiosity, can someone explain to me, what "ensemble" in this context means?

Here ensemble means flipping and rotating the images spatially. After rotating and flipping, you should have 8 copies of the original sequence. Then we do inference 8 times and take the average of the outputs.

0 replies

Memnarch · 2021-10-06T13:05:12Z

Memnarch
Oct 6, 2021

Ah interesting, so 4 copies per 90 degree rotation, unmirrored+mirrored. Is the average a simple per pixel calculation or a more complex calculation?

Currently working on a heavily compressed lowres clip and my current steps for a quite good result go as:
Bicubic upscale x2, run track3 model, Bicubic downscale to original, run track 3 model followed by an upscale using Vimeo-90K (BI) model. Though, I need to retest the last step with the BD model and replace the Bicubic downscale with a gausian blur + downscale.

Anyway, the ensemble route looks interesting. Looking at the compute time for my 1080ti as is....oh it's going to cry^^"

0 replies

ckkelvinchan · 2021-10-07T01:52:38Z

ckkelvinchan
Oct 7, 2021
Collaborator

Ah interesting, so 4 copies per 90 degree rotation, unmirrored+mirrored. Is the average a simple per pixel calculation or a more complex calculation?

Currently working on a heavily compressed lowres clip and my current steps for a quite good result go as: Bicubic upscale x2, run track3 model, Bicubic downscale to original, run track 3 model followed by an upscale using Vimeo-90K (BI) model. Though, I need to retest the last step with the BD model and replace the Bicubic downscale with a gausian blur + downscale.

Anyway, the ensemble route looks interesting. Looking at the compute time for my 1080ti as is....oh it's going to cry^^"

That is quite a lot of steps. I think there could be some better ways to go, but that would require more explorations.

0 replies

Memnarch · 2021-10-07T06:09:58Z

Memnarch
Oct 7, 2021

That is quite a lot of steps. I think there could be some better ways to go, but that would require more explorations.

Yup, but the ensemble way doesn't sound like less steps^^
The videos I deal with are from a game using Cinepak codec. In heavily dynamic scenes it gets really blocky. But with those steps above I got them quite smooth.

Currently experimenting with an ai based deblock prepass. Seemed to improve it even more, but seemed to have "smoothed" some things out a bit. Something I need to explore more.

EDIT: And now I'm thinking if it's worth training my own model with the Vimeo 90K dataset but using cinepak as degradationprocess to provide the lr images oO. Though, my assumption is this is going to take ages on my 1080ti, if the memory is even enough

0 replies

ckkelvinchan · 2021-10-27T12:49:56Z

ckkelvinchan
Oct 27, 2021
Collaborator

That is quite a lot of steps. I think there could be some better ways to go, but that would require more explorations.

Yup, but the ensemble way doesn't sound like less steps^^ The videos I deal with are from a game using Cinepak codec. In heavily dynamic scenes it gets really blocky. But with those steps above I got them quite smooth.

Currently experimenting with an ai based deblock prepass. Seemed to improve it even more, but seemed to have "smoothed" some things out a bit. Something I need to explore more.

EDIT: And now I'm thinking if it's worth training my own model with the Vimeo 90K dataset but using cinepak as degradationprocess to provide the lr images oO. Though, my assumption is this is going to take ages on my 1080ti, if the memory is even enough

I think Vimeo-90K is not a very good dataset if you want to use recurrent networks, since it contains only 7 frames for each sequence.

0 replies

Memnarch · 2021-10-28T16:40:19Z

Memnarch
Oct 28, 2021

Oh okay. Do you have any suggestions what is a better fit?

0 replies

ckkelvinchan · 2021-10-29T09:39:06Z

ckkelvinchan
Oct 29, 2021
Collaborator

If you can construct the "low quality" videos by yourself, you can consider using the REDS dataset. It contains 100 or 500 high-quality frames per sequence, depending on which version you use.

0 replies

Memnarch · 2021-10-29T09:41:12Z

Memnarch
Oct 29, 2021

If you can construct the "low quality" videos by yourself, you can consider using the REDS dataset. It contains 100 or 500 high-quality frames per sequence, depending on which version you use.

Yup, I can (and even need to) do it. Thanks :)

0 replies

lotress · 2021-11-25T08:47:34Z

lotress
Nov 25, 2021

Thanks for your great work.
Can you compare performance w/o ensemble?
And any advice for scale low quality videos without retraining?

0 replies

ckkelvinchan · 2021-11-25T11:34:14Z

ckkelvinchan
Nov 25, 2021
Collaborator

Thanks for your great work. Can you compare performance w/o ensemble? And any advice for scale low quality videos without retraining?

The model in MMEditing is currently without ensemble. The code for ensemble is still in PR, we can do a comparison afterwards.

About your second question, I am not quite sure what do you mean.

0 replies

lotress · 2021-11-26T02:55:13Z

lotress
Nov 26, 2021

Thanks for your great work. Can you compare performance w/o ensemble? And any advice for scale low quality videos without retraining?

The model in MMEditing is currently without ensemble. The code for ensemble is still in PR, we can do a comparison afterwards.

About your second question, I am not quite sure what do you mean.

I mean the model is less effective for compressed video in reality practice than those low resolution but clear videos. Try different sets of down-than-up scaling may help but is there any blind ways to improve effectiveness?

0 replies

Memnarch · 2021-11-26T06:13:54Z

Memnarch
Nov 26, 2021

Thanks for your great work. Can you compare performance w/o ensemble? And any advice for scale low quality videos without retraining?

The model in MMEditing is currently without ensemble. The code for ensemble is still in PR, we can do a comparison afterwards.
About your second question, I am not quite sure what do you mean.

I mean the model is less effective for compressed video in reality practice than those low resolution but clear videos. Try different sets of down-than-up scaling may help but is there any blind ways to improve effectiveness?

I assume that is due to how this model was pretrained. Training your own version on specific compressionmethods may give better results for specific cases.

0 replies

ZcsrenlongZ · 2022-02-13T12:50:57Z

ZcsrenlongZ
Feb 13, 2022

Hi, I'm reproducing your paper, so could you release the ensemble test code?

0 replies

ckkelvinchan · 2022-02-14T01:49:31Z

ckkelvinchan
Feb 14, 2022
Collaborator

Hi, I'm reproducing your paper, so could you release the ensemble test code?

Hello, you can refer to #585. The PR will be merged after review.

0 replies

ZcsrenlongZ · 2022-02-14T05:06:40Z

ZcsrenlongZ
Feb 14, 2022

Hi, I'm reproducing your paper, so could you release the ensemble test code?

Hello, you can refer to #585. The PR will be merged after review.

Thanks a lot. You do me a great favor.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BasicVSR++: reproduce ntire decompression results on track3 #1216

{{title}}

Replies: 17 comments

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

{{title}}

{{title}}

{{title}}

Select a reply

BasicVSR++: reproduce ntire decompression results on track3 #1216

Replies: 17 comments

ckkelvinchan Sep 27, 2021 Collaborator

ckkelvinchan Oct 6, 2021 Collaborator

ckkelvinchan Oct 7, 2021 Collaborator

ckkelvinchan Oct 27, 2021 Collaborator

ckkelvinchan Oct 29, 2021 Collaborator

ckkelvinchan Nov 25, 2021 Collaborator

ckkelvinchan Feb 14, 2022 Collaborator

ckkelvinchan
Sep 27, 2021
Collaborator

ckkelvinchan
Oct 6, 2021
Collaborator

ckkelvinchan
Oct 7, 2021
Collaborator

ckkelvinchan
Oct 27, 2021
Collaborator

ckkelvinchan
Oct 29, 2021
Collaborator

ckkelvinchan
Nov 25, 2021
Collaborator

ckkelvinchan
Feb 14, 2022
Collaborator