⭐ Star AlbumentationsX on GitHub — 294+ stars and counting!

Star on GitHub
TencentARC

ARC Lab, Tencent PCG

@TencentARCOrganization
Public repos
80
Public gists
0
Member since
May 6, 2021

On the leaderboard

RankRepositoryStars
670TencentARC/GFPGAN37,410

Top repositories by stars

  • TencentARC/GFPGAN(on leaderboard)

    GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.

    Python37,379
  • TencentARC/PhotoMaker

    PhotoMaker [CVPR 2024]

    Jupyter Notebook10,121
  • TencentARC/InstantMesh

    InstantMesh: Efficient 3D Mesh Generation from a Single Image with Sparse-view Large Reconstruction Models

    Python4,265
  • Python3,791
  • TencentARC/BrushNet

    [ECCV 2024] The official implementation of paper "BrushNet: A Plug-and-Play Image Inpainting Model with Decomposed Dual-Branch Diffusion"

    Python1,709
  • TencentARC/MotionCtrl

    Official Code for MotionCtrl [SIGGRAPH 2024]

    Python1,493
  • TencentARC/SEED-Voken

    SEED-Voken: A Series of Powerful Visual Tokenizers

    Python994
  • TencentARC/SEED-Story

    SEED-Story: Multimodal Long Story Generation with Large Language Model

    Python884
  • TencentARC/MasaCtrl

    [ICCV 2023] Consistent Image Synthesis and Editing

    Python837
  • TencentARC/BrushEdit

    [under review] The official implementation of paper "BrushEdit: All-In-One Image Inpainting and Editing"

    Python587
  • TencentARC/VideoPainter

    [SIGGRAPH2025] Official repo for paper "Any-length Video Inpainting and Editing with Plug-and-Play Context Control"

    Python569
  • TencentARC/ToonComposer

    [ICLR 2026] Streamlining Cartoon Production with Generative Post-Keyframing

    Python542
  • TencentARC/LLaMA-Pro

    [ACL 2024] Progressive LLaMA with Block Expansion.

    Python514
  • TencentARC/ColorFlow

    The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization". ColorFlow:基于检索增强的图像序列上色

    Python456
  • TencentARC/StereoCrafter

    A framework to convert any 2D videos to immersive stereoscopic 3D

    Python448
  • TencentARC/GeometryCrafter

    [ICCV 2025] GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors

    Python431
  • TencentARC/Mix-of-Show

    NeurIPS 2023, Mix-of-Show: Decentralized Low-Rank Adaptation for Multi-Concept Customization of Diffusion Models

    Python428
  • TencentARC/SmartEdit

    Official code of SmartEdit [CVPR-2024 Highlight]

    Python370
  • TencentARC/AnimeSR

    Codes for "AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos"

    Python364
  • TencentARC/VQFR

    ECCV 2022, Oral, VQFR: Blind Face Restoration with Vector-Quantized Dictionary and Parallel Decoder

    Python352
  • TencentARC/AnimeGamer

    [ICCV 2025] AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

    Python345
  • TencentARC/RollingForcing

    [ICLR 2026] Official Repo for Rolling Forcing: Autoregressive Long Video Diffusion in Real Time

    Python323
  • TencentARC/DiTCtrl

    [CVPR 2025] Official code of "DiTCtrl: Exploring Attention Control in Multi-Modal Diffusion Transformer for Tuning-Free Multi-Prompt Longer Video Generation"

    Python321
  • TencentARC/AudioStory

    AudioStory: Generating Long-Form Narrative Audio with Large Language Models

    Jupyter Notebook301
  • TencentARC/VerseCrafter

    VerseCrafter: Dynamic Realistic Video World Model with 4D Geometric Control

    Python300
  • TencentARC/TokLIP

    TokLIP: Marry Visual Tokens to CLIP for Multimodal Comprehension and Generation

    Python236
  • TencentARC/UMT

    UMT is a unified and flexible framework which can handle different input modality combinations, and output video moment retrieval and/or highlight detection results.

    Python234
  • TencentARC/ARC-Hunyuan-Video-7B

    Structured Video Comprehension of Real-World Shorts

    Python229
  • TencentARC/FreeSplatter

    [ICCV 2025] FreeSplatter: Pose-free Gaussian Splatting for Sparse-view 3D Reconstruction

    JavaScript222
  • TencentARC/ViT-Lens

    [CVPR 2024] ViT-Lens: Towards Omni-modal Representations

    Python190
  • TencentARC/MM-RealSR

    Codes for "Metric Learning based Interactive Modulation for Real-World Super-Resolution"

    Python175
  • TencentARC/Moto

    [ICCV2025 Oral] Latent Motion Token as the Bridging Language for Learning Robot Manipulation from Videos

    Python162
  • TencentARC/IC-Custom

    [Arxiv'25] IC-Custom: Diverse Image Customization via In-Context Learning

    Python160
  • TencentARC/ST-LLM

    [ECCV 2024🔥] Official implementation of the paper "ST-LLM: Large Language Models Are Effective Temporal Learners"

    Python150
  • TencentARC/GenCompositor

    [ICLR 2026] GenCompositor: Generative Video Compositing with Diffusion Transformer

    Python148
  • TencentARC/DeSRA

    Official codes for DeSRA (ICML 2023)

    Python141
  • TencentARC/MCQ

    Official code for "Bridging Video-text Retrieval with Multiple Choice Questions", CVPR 2022 (Oral).

    Python141
  • TencentARC/DI-PCG

    Code release of our paper "DI-PCG: Diffusion-based Efficient Inverse Procedural Content Generation for High-quality 3D Asset Creation".

    Python134
  • TencentARC/NVComposer

    [CVPR 2025] Boosting Generative Novel View Synthesis with Sparse and Unposed Images

    Python124
  • TencentARC/FAIG

    NeurIPS 2021, Spotlight, Finding Discriminative Filters for Specific Degradations in Blind Super-Resolution

    Python118
  • TencentARC/ArcNerf

    Nerf and extensions in all

    Jupyter Notebook107
  • TencentARC/TimeLens

    TimeLens: Rethinking Video Temporal Grounding with Multimodal LLMs

    Python102
  • TencentARC/mllm-npu

    mllm-npu: training multimodal large language models on Ascend NPUs

    Python95
  • TencentARC/Video-Holmes

    Video-Holmes: Can MLLM Think Like Holmes for Complex Video Reasoning?

    Python87
  • TencentARC/Divot

    Diffusion Powers Video Tokenizer for Comprehension and Generation (CVPR 2025)

    Python86
  • TencentARC/SurfelNeRF

    SurfelNeRF: Neural Surfel Radiance Fields for Online Photorealistic Reconstruction of Indoor Scenes

    78
  • TencentARC/MotionCrafter

    MotionCrafter: Dense Geometry and Motion Reconstruction with a 4D VAE

    Python76
  • TencentARC/RepSR

    Codes for "RepSR: Training Efficient VGG-style Super-Resolution Networks with Structural Re-Parameterization and Batch Normalization"

    75
  • TencentARC/HOSNeRF

    HOSNeRF: Dynamic Human-Object-Scene Neural Radiance Fields from a Single Video

    Python68
  • Jupyter Notebook65
  • TencentARC/FastRealVSR

    Codes for "Mitigating Artifacts in Real-World Video Super-Resolution Models"

    59
  • TencentARC/GVT

    Official code for "What Makes for Good Visual Tokenizers for Large Language Models?".

    Python58
  • TencentARC/ConMIM

    Official codes for ConMIM (ICLR 2023)

    Python58
  • TencentARC/TVTS

    Turning to Video for Transcript Sorting

    Jupyter Notebook49
  • TencentARC/BEBR

    Official code for "Binary embedding based retrieval at Tencent"

    Python44
  • TencentARC/ARC-Chapter

    Structuring Hour-Long Videos into Navigable Chapters and Hierarchical Summaries

    34
  • TencentARC/SGAT4PASS

    [IJCAI 2023] official implementation of the paper SGAT4PASS: Spherical Geometry-Aware Transformer for PAnoramic Semantic Segmentation

    Python34
  • TencentARC/pi-Tuning

    Official code for "pi-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation", ICML 2023.

    Python33
  • TencentARC/BTS

    BTS: A Bi-lingual Benchmark for Text Segmentation in the Wild

    33
  • TencentARC/FLM

    Accelerating Vision-Language Pretraining with Free Language Modeling (CVPR 2023)

    Python32
  • TencentARC/Efficient-VSR-Training

    Codes for "Accelerating the Training of Video Super-Resolution"

    30
  • TencentARC/DTN

    Official code for "Dynamic Token Normalization Improves Vision Transformer", ICLR 2022.

    Python29
  • TencentARC/BlobCtrl

    [SIGGRAPH ASIA'25] BlobCtrl: Taming Controllable Blob for Element-level Image Editing

    Python26
  • TencentARC/OpenCompatible

    OpenCompatible provides a standard compatible training benchmark, covering practical training scenarios.

    Python25
  • TencentARC/TaCA

    Official code for the paper, "TaCA: Upgrading Your Visual Foundation Model with Task-agnostic Compatible Adapter".

    16
  • TencentARC/common_trainer

    Common template for pytorch project. Easy to extent and modify for new project.

    Python13
  • TencentARC/TransFusion

    The code repo for the ACM MM paper: TransFusion: Multi-Modal Fusion for Video Tag Inference viaTranslation-based Knowledge Embedding.

    9
  • TencentARC/ArcVis

    Visualization of 3d and 2d components interactively.

    Jupyter Notebook6
  • TencentARC/vllm

    vllm for ARC-Hunyuan-Video-7B

    Python3