AIMonk Labs
Popular repositories Loading
-
awesome-segmentation-saliency-dataset
awesome-segmentation-saliency-dataset PublicForked from FnSK4R17s/awesome-segmentation-saliency-dataset
A collection of some datasets for segmentation / saliency detection. Welcome to PR...:smile:
HTML
-
DALI
DALI PublicForked from NVIDIA/DALI
A GPU-accelerated library containing highly optimized building blocks and an execution engine for data processing to accelerate deep learning training and inference applications.
C++
-
end2end_open_release
end2end_open_release PublicAn end to end model for detection and recognition of images
Python
-
vall-e
vall-e PublicForked from lifeiteng/vall-e
PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html
Python
-
-
FastSpeech2
FastSpeech2 PublicForked from ming024/FastSpeech2
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Python
Repositories
- MuseTalk Public Forked from TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting
aimonk-labs/MuseTalk’s past year of commit activity - fairseq Public Forked from facebookresearch/fairseq
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
aimonk-labs/fairseq’s past year of commit activity - SyncTalk Public Forked from ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
aimonk-labs/SyncTalk’s past year of commit activity - tortoise-tts Public Forked from neonbjb/tortoise-tts
A multi-voice TTS system trained with an emphasis on quality
aimonk-labs/tortoise-tts’s past year of commit activity - IP_LAP Public Forked from Weizhi-Zhong/IP_LAP
CVPR2023 talking face implementation for Identity-Preserving Talking Face Generation With Landmark and Appearance Priors
aimonk-labs/IP_LAP’s past year of commit activity - aeneas Public Forked from readbeyond/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
aimonk-labs/aeneas’s past year of commit activity - ICCV2023-MCNET Public archive Forked from harlanhong/ICCV2023-MCNET
The official code of our ICCV2023 work: Implicit Identity Representation Conditioned Memory Compensation Network for Talking Head video Generation
aimonk-labs/ICCV2023-MCNET’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…