Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[BUG] 2.3.0-A with --psy-rd >0 produces artifacts and inconsistent results with identical source and settings. #117

Open
1 of 2 tasks
igorbaryshev opened this issue Dec 26, 2024 · 13 comments
Assignees
Labels
bug Something isn't working

Comments

@igorbaryshev
Copy link

igorbaryshev commented Dec 26, 2024

Overview

  • Random block-shaped artifacts at the exact same frames:
    mpv_uPNj9akV41
    mpv_NTlz8ectta
    mpv_D7bapSefgH
    mpv_KUGXi5Ftk7

  • A screenshot from another encode to make sure it's not just one video issue:
    image
     

  • File sizes are inconsistent as well, obviously.
    image

  • Presets 5, 6, 7 are the most affected. The slowest I tried was 3 and it wasn't as noticeable there. Preset 10 I couldn't really tell.

  • Starts to become visible at --psy-rd values around 0.5 and up, but file size inconsistencies are present with any non-0 value at any preset I tried.

  • CRF's tried were 35 and 22.5

Branch
In which branch does the issue appear to be occurring?

  • master
  • testing

Reproduction
Steps to reproduce the behavior:

  1. Encode with --psy-rd 6.0 and --preset 6
  2. Watch the result

Expected behavior
No obvious artifacts, consistent encoding results.

Platform (please complete the following information):

  • Windows 11 24H2
  • Clang 19.1.6, and another build by @Andarwinux with Clang 20.0.0git
  • Build command for my Clang 19.1.6 build:
cmake -B svt_build --fresh -T ClangCL -DBUILD_SHARED_LIBS=OFF -DENABLE_AVX512=OFF -DLIBDOVI_FOUND=1 -DLIBHDR10PLUS_RS_FOUND=1 -DCMAKE_CXX_FLAGS_RELEASE="/DNDEBUG /clang:-O2 -flto -march=znver1 -fprofile-use=D:\PGO\IR+CSIR.profdata" -DCMAKE_C_FLAGS_RELEASE="/DNDEBUG /clang:-O2 -flto -march=znver1 -fprofile-use=D:\PGO\IR+CSIR.profdata" && cmake --build svt_build --config Release

Version String (please complete the following information):

  • Clang 19.6.1 build
SVT-AV1-PSY v2.3.0-A-dirty (release)
PSY Release: A
  • Clang 20.0.0git build
SVT-AV1-PSY v2.3.0-A (release)
PSY Release: A

Additional context / Relevant Files

  • Was only able to test 10-bit encoding, because 8-bit encodes crash at the very beginning after random number of frames, like up to 400, sometimes as low as 44, average ~120.
  • I did several encodes with my Clang 19.1.6 build to verify that it's not just a build difference when it comes to resulting file size and artifact randomness.
  • With --psy-rd 0 files seem to be identical from encode to encode.
@igorbaryshev igorbaryshev added the bug Something isn't working label Dec 26, 2024
@BlueSwordM
Copy link
Collaborator

BlueSwordM commented Dec 27, 2024

Thank you for the bug report. I will try to reproduce it on my end (CachyOS Linux).

Did you try weaker settings (0.5 to 4.0)? Perhaps the calculated distortion is too high, because while I can't reproduce the delerious transform bug, I can see artifacts that never appear at more reasonable strengths (<2.0).

@igorbaryshev
Copy link
Author

igorbaryshev commented Dec 27, 2024

Did you try weaker settings (0.5 to 4.0)?

Yep, tried 3.0 and 4.0, same story. Just like I said, anything at 0.5 and up becomes pretty visible on my side.

@igorbaryshev
Copy link
Author

--psy-rd 1.0
mpv_WrGAL63L71

@kuzon3
Copy link

kuzon3 commented Dec 28, 2024

im getting the same bug even at 0.5.
built my own svt using the build instructions for windows.
Edit: Ok after some testing found out that if you use the default crf settings. --psy-rd works fine (did my testing with --psy-rd 3.0) but if you use a lower crf setting like crf 18. artifacts start appearing.
tests where made using svts defaults settings (i did switch to slow once i saw no artifacts where appearing at this setting. crf was still at 35(default))
Edit 2: some new info about psy-rd. i was coverting a movie from VC-1 to AV1 and decided to try psy rd and it worked! i was using default settings on slow with a crf of 14. the only thing different was VC-1 and i cropped out top and bottom black bars.
movie was "the fast and furious" bluray

@Ironclad17
Copy link

Ironclad17 commented Dec 29, 2024

I saw the same occasionally in very noisy, heavily filtered scenes with --psy-rd 1.0. Lowering to crf 10.0 and preset -2 appeared to resolve it, but it may have been random. This occurred in both 8-bit & 10-bit encodes of the same scene.
call ".\vcvars64.bat" && cmake --fresh -B svt_build -G Ninja -DCMAKE_BUILD_TYPE=Release -DBUILD_SHARED_LIBS=OFF -DBUILD_APPS=ON -DSVT_AV1_LTO=OFF -DLIBDOVI_FOUND=ON -DLIBHDR10PLUS_RS_FOUND=ON -DCMAKE_CXX_FLAGS_RELEASE="-Qunused-arguments -Wno-deprecated /DNDEBUG /clang:-O2 -flto -march=znver2 -fprofile-use=".\PGO\IR+CSIR.profdata"" -DCMAKE_C_FLAGS_RELEASE="-Qunused-arguments -Wno-deprecated /DNDEBUG /clang:-O2 -flto -march=znver2 -fprofile-use=".\PGO\IR+CSIR.profdata""

The scene in question was: --crf 15.0 --preset 4 --tune 3 --sharpness 2 --frame-luma-bias 100 --noise-norm-strength 4 --variance-boost-strength 4 --variance-octile 4 --psy-rd 1.0 --enable-dlf 2 --enable-cdef 0 --enable-restoration 0 --enable-tf 0 --fast-decode 1 --color-range 0 --matrix-coefficients 1 --transfer-characteristics 1 --color-primaries 1 --keyint 0 --lp 4
--crf 26.0 and --crf 44.5 had similar issues.

@gianni-rosato
Copy link
Collaborator

Hi,

Would you be able to provide a couple of things to assist in our diagnosis?

  • A sample clip
  • Encoding settings to reproduce the issues you describe
  • Test conditions that reproduce the bug on Linux and/or macOS

I'm able to reproduce the issue with inconsistent run-to-run behavior on Linux, but I cannot reproduce any visual issues.

We should probably make this more clear in our documentation, but it is worth noting that SVT-AV1-PSY is not officially supported for Windows and there are no plans to change this any time soon. We're most interested in reproducing the visual inconsistencies you present here in Linux/macOS environments. We greatly appreciate your help.

@igorbaryshev
Copy link
Author

  • Sample clip ->friends.zip
    *.y4m file was over the size limit packed in a 7z inside zip, so I'm providing the clip that I made the .y4m from with this command: ffmpeg -i friends.mkv -strict -1 friends.y4m
  • The simplest encoding settings successfully reproduce the issue on Ubuntu 24.10:
    SvtAv1EncApp --input friends.y4m --output friends.ivf --preset 6 --psy-rd 6.0
  • Test conditions were just basic Ubuntu 24.10 and the app itself, built with build.sh -x from testing branch with latest commit being fbe4e20 (just wanted to see whether it got fixed somehow in testing vs master)

@BlueSwordM
Copy link
Collaborator

  • Sample clip ->friends.zip
    *.y4m file was over the size limit packed in a 7z inside zip, so I'm providing the clip that I made the .y4m from with this command: ffmpeg -i friends.mkv -strict -1 friends.y4m

    • The simplest encoding settings successfully reproduce the issue on Ubuntu 24.10:
      SvtAv1EncApp --input friends.y4m --output friends.ivf --preset 6 --psy-rd 6.0

    • Test conditions were just basic Ubuntu 24.10 and the app itself, built with build.sh -x from testing branch with latest commit being fbe4e20 (just wanted to see whether it got fixed somehow in testing vs master)

Oh thank you, I am now able to reproduce the issue.
On my sources, even at P9, I wasn't able to reproduce this.

Thanks a lot ❤️

@BlueSwordM
Copy link
Collaborator

I was able to fix build issues, and have found new findings.

With this new patch in the testing branch:
a3f3ca1

A bug that a user messaged me personally was fixed by changing the absolute value conversion, which fixed the your bug at Preset 5 and Preset >8.

However, the issue still persists on Preset 6 and Preset 7.
I can't reproduce the issue with high --psy-rd values, but not low --psy-rd values.

In summary:

  1. Fixed some buggy code that was only visible in debug mode with the YASM assembler; NASM couldn't detect the undefined code.
  2. The issue with --psy-rdcan only be replicated with values >0.25.
  3. The issue with --psy-rd can only be replicated with Preset 6-7.

My best guess? Overflowing values with the RDO calculations for the rest of the encoded block artifacts, causing the encoder to believe they have so much distortion to blur them to pieces.

@igorbaryshev
Copy link
Author

@BlueSwordM, I'll get to checking it out in a bit, for sure!
Meanwhile, got another one for ya #122 😁

@igorbaryshev
Copy link
Author

igorbaryshev commented Jan 13, 2025

@BlueSwordM the issue is still present for me with latest testing commit.
This is on preset 3 with psy-rd 6:
image

@BlueSwordM
Copy link
Collaborator

I was able to fix build issues, and have found new findings.

With this new patch in the testing branch: a3f3ca1

A bug that a user messaged me personally was fixed by changing the absolute value conversion, which fixed the your bug at Preset 5 and Preset >8.

However, the issue still persists on Preset 6 and Preset 7. I can't reproduce the issue with high --psy-rd values, but not low --psy-rd values.

In summary:

1. Fixed some buggy code that was only visible in debug mode with the YASM assembler; NASM couldn't detect the undefined code.

2. The issue with `--psy-rd`can only be replicated with values >0.25.

3. The issue with `--psy-rd` can only be replicated with Preset 6-7.

My best guess? Overflowing values with the RDO calculations for the rest of the encoded block artifacts, causing the encoder to believe they have so much distortion to blur them to pieces.

Better news: my guess has become reality.
I found the issue: overflows were happening since the variable types we picked were... inadequate for some cases in LBD (8-bit) and very inadequate in HBD (10-bit).

It's even possible that we are losing some quality over this since it possibly means the wrong values are being calculated.
I'll try fix it Friday when I finally have time.

@A5362
Copy link

A5362 commented Jan 13, 2025

I also have this issue with --psy-rd4, I was using 2.3.0-A

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

6 participants