⭐ Star AlbumentationsX on GitHub — 294+ stars and counting!

Star on GitHub
hacksider

Kenneth Estanislao

@hacksiderUser

I typically don’t respond to chats unless we’ve exchanged emails first—think of it as the digital handshake. And while I don’t work for free, I do work for some

Public repos
149
Followers
2,603
Following
9
Public gists
0
Member since
Dec 16, 2011

On the leaderboard

RankRepositoryStars
125hacksider/Deep-Live-Cam88,455

Top repositories by stars

  • hacksider/Deep-Live-Cam(on leaderboard)

    real time face swap and one-click video deepfake with only a single image

    Python79,536
  • hacksider/ShortsGenerator

    Automate the creation of Shorts content locally with a couple simple steps.

    Python42
  • hacksider/screenshot-to-code

    Drop in a screenshot and convert it to clean code (HTML/Tailwind/React/Vue)

    TypeScript26
  • hacksider/Short-Video-Creator

    Automatic | AI-generated captions - No API Key | Background Video | YouTube Shorts | TikTok

    Python24
  • hacksider/Webcam_Live_Portrait

    Bring portraits to life via webcam!

    Python23
  • hacksider/real-time-voice-translator

    A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

    Tcl19
  • hacksider/call-gpt

    Generative AI phone call toolkit using Twilio Media Streams.

    JavaScript19
  • hacksider/Deep-Translate-Engine

    a live subtitle with translation engine

    Python18
  • hacksider/sd-webui-roop-uncensored

    uncensored roop extension for StableDiffusion web-ui

    Python18
  • hacksider/aidialer

    A full stack app for interruptible, low-latency and near-human quality AI phone calls built from stitching LLMs, speech understanding tools, text-to-speech models, and Twilio’s phone API.

    Python15
  • hacksider/ShortGPT

    🚀🎬 ShortGPT - Experimental AI framework for automated short/video content creation.

    Python15
  • hacksider/Deep-Live-Mic

    Advanced RVC Inference for quicker and effortless model downloads

    Python14
  • hacksider/LLPlayer

    The media player for language learning, with dual subtitles, AI-generated subtitles, realtime-OCR, translation, word lookup, and more!

    C#14
  • hacksider/maxun

    Free, open-source no-code web data extraction platform. Build custom robots to automate data scraping [In Beta]

    TypeScript14
  • hacksider/suna

    Suna - Open Source Generalist AI Agent

    TypeScript13
  • hacksider/RealTime-Voice-Translation-using-Whisper

    The application allows users to record speech, transcribe it using the Whisper ASR (Automatic Speech Recognition) model, translate the transcribed text into a selected language, and play back the translated text using the Elevenlabs TTS (Text-to-Speech) engine.

    Python13
  • hacksider/kestra

    :zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...

    Java11
  • hacksider/fantasy-talking

    FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

    Python10
  • hacksider/dia

    A TTS model capable of generating ultra-realistic dialogue in one pass.

    Python10
  • hacksider/face-censor

    Detect and blur faces in any input images or videos with AI.

    Python10
  • hacksider/movie-web

    A small web app for watching movies and shows easily

    TypeScript10
  • hacksider/Wan2.2

    Wan: Open and Advanced Large-Scale Video Generative Models

    Python9
  • hacksider/hallo-for-windows

    Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation

    Python9
  • hacksider/Face_Animation_Real_Time

    One-shot face animation using webcam, capable of running in real time.

    Python9
  • hacksider/DeepTutor

    "DeepTutor: AI-Powered Personalized Learning Assistant"

    Python7
  • hacksider/PersonaLive

    PersonaLive! : Expressive Portrait Image Animation for Live Streaming

    Python7
  • hacksider/ai-data-analysis-MulitAgent

    AI-Driven Research Assistant: An advanced multi-agent system for automating complex research processes. Leveraging LangChain, OpenAI GPT, and LangGraph, this tool streamlines hypothesis generation, data analysis, visualization, and report writing. Perfect for researchers and data scientists seeking to enhance their workflow and productivity.

    Python7
  • hacksider/Integuru

    The first AI agent that builds third-party integrations through reverse engineering platforms' internal APIs.

    Python7
  • hacksider/xtts2-ui

    A User Interface for XTTS-2 Text-Based Voice Cloning

    Python7
  • hacksider/superagent

    🥷 The open framework for building AI Assistants

    JavaScript7
  • hacksider/abogen

    Generate audiobooks from EPUBs, PDFs and text with synchronized captions.

    Python6
  • hacksider/voice-pro

    Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

    Python6
  • hacksider/KrillinAI

    A video translation and dubbing tool powered by LLMs, offering professional-grade translations and one-click full-process deployment. It can generate content optimized for platforms like YouTube,TikTok, and Shorts. 基于AI大模型的视频翻译和配音工具,专业级翻译,一键部署全流程,可以生成适配抖音,小红书,哔哩哔哩,视频号,TikTok,Youtube Shorts等形态的内容

    Go6
  • hacksider/Wan2GP

    Wan 2.1 for the GPU Poor

    Python6
  • hacksider/Zonos

    Zonos-v0.1 is a leading open-weight text-to-speech model trained on more than 200k hours of varied multilingual speech, delivering expressiveness and quality on par with—or even surpassing—top TTS providers.

    Python6
  • hacksider/Thin-plate-spline-motion-model-ONNX-Faceswap

    Thin Plate Spline Motion Model - ONNX. Extended version for FaceSwap - HeadSwap - PartSwap

    Python6
  • hacksider/Rope

    GUI-focused roop

    Python6
  • hacksider/openv0

    AI generated UI components

    TypeScript6
  • hacksider/Voost

    [Official] Voost: A Unified and Scalable Diffusion Transformer for Bidirectional Virtual Try-On and Try-Off

    5
  • hacksider/chatterbox

    SoTA open-source TTS

    Python5
  • hacksider/SurfSense

    Open Source Alternative to NotebookLM / Perplexity / Glean, connected to external sources such as search engines (Tavily, Linkup), Slack, Linear, Notion, YouTube, GitHub and more.

    TypeScript5
  • hacksider/FacePoke

    Select a portrait, click to move the head around (please use your own space / GPU!)

    JavaScript5
  • hacksider/s3_upload_shell

    Simply upload all the files to s3 every day and delete files on the folder every 10 days

    Shell5
  • hacksider/system-design-101

    Explain complex systems using visuals and simple terms. Help you prepare for system design interviews.

    5
  • hacksider/appwrite

    Build like a team of hundreds_

    TypeScript5
  • hacksider/skypilot

    SkyPilot: Run LLMs, AI, and Batch jobs on any cloud. Get maximum savings, highest GPU availability, and managed execution—all with a simple interface.

    Python5
  • hacksider/StableAvatar

    We present StableAvatar, the first end-to-end video diffusion transformer, which synthesizes infinite-length high-quality audio-driven avatar videos without any post-processing, conditioned on a reference image and audio.

    Python4
  • hacksider/ToonComposer

    Streamlining Cartoon Production with Generative Post-Keyframing

    Python4
  • hacksider/ColorFlow

    The official implementation of paper "ColorFlow: Retrieval-Augmented Image Sequence Colorization"

    Python4
  • hacksider/Live_Portrait_Monitor

    Bring portraits to life via Monitor!

    Python4
  • hacksider/Ultimate-Facebook-Scraper

    🤖 A Software that automates your social media interactions to collect posts, photos, videos, interests, friends, followers, and much more on Facebook.

    4
  • hacksider/SimpleMem

    SimpleMem: Efficient Lifelong Memory for LLM Agents

    Python3
  • hacksider/MoCha

    MoCha: End-to-End Video Character Replacement without Structural Guidance

    Python3
  • hacksider/deer-flow

    DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.

    TypeScript3
  • hacksider/STAR

    STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution

    Python3
  • hacksider/MAGI-1

    MAGI-1: Autoregressive Video Generation at Scale

    Python3
  • hacksider/SVFR

    Official implementation of SVFR.

    Python3
  • hacksider/AniPortrait-for-windows

    AniPortrait: Audio-Driven Synthesis of Photorealistic Portrait Animation

    Python3
  • hacksider/magentic-ui

    A research prototype of a human-centered web agent

    Python2
  • hacksider/comfyui-vrgamedevgirl

    Custom ComfyUI nodes for film grain, color matching, and video enhancement.

    Python2
  • hacksider/whispering-ui

    Native UI for the Whispering Tiger project - https://github.com/Sharrnah/whispering (live transcription / translation)

    Go2
  • hacksider/kilocode

    Open Source AI coding assistant for planning, building, and fixing code. We're a superset of Roo, Cline, and our own features. Follow us: kilocode.ai/social

    TypeScript2
  • hacksider/trae-agent

    Trae Agent is an LLM-based agent for general purpose software engineering tasks.

    Python2
  • hacksider/FasterLivePortrait

    Bring portraits to life in Real Time!onnx/tensorrt support!实时肖像驱动!

    Python2
  • hacksider/thera

    Thera: Aliasing-Free Arbitrary-Scale Super-Resolution with Neural Heat Fields

    Python2
  • hacksider/StdGEN

    [CVPR 2025] StdGEN: Semantic-Decomposed 3D Character Generation from Single Images

    Python2
  • hacksider/LHM

    Code of LHM: Large Animatable Human Reconstruction Model for Single Image to 3D in Seconds

    Python2
  • Python2
  • hacksider/echomimic_v2

    EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

    Python2
  • hacksider/paperless-ngx

    A community-supported supercharged version of paperless: scan, index and archive all your physical documents

    Python2
  • hacksider/mimic_head

    Unofficial One-click Version of LivePortrait, with Webcam Support

    Python2
  • hacksider/vid2densepose

    Convert your videos to densepose and use it on MagicAnimate

    Python2
  • hacksider/Synthalingua

    Synthalingua - Real Time Translation

    Python2
  • hacksider/Blur-Detection-Web-App

    Blur Detection Web App with OpenCV and Flask

    Python2
  • hacksider/clamav-rest-api

    ClamAV REST API. Scan files using simple POST request.

    JavaScript2
  • hacksider/stable-diffusion-webui

    Stable Diffusion web UI

    Python2
  • hacksider/ACE-Step-1.5

    The most powerful local music generation model that outperforms most commercial alternatives, supporting Mac, AMD, Intel, and CUDA devices.

    Python1
  • hacksider/comic-translate

    Desktop app for automatically translating comics - BDs, Manga, Manhwa, Fumetti and more in a variety of formats (Image, Pdf, Epub, cbr, cbz, etc) and in multiple languages.

    Python1
  • hacksider/LuxTTS

    A high-quality rapid TTS voice cloning model that reaches speeds of 150x realtime.

    Python1
  • hacksider/echomimic_v3

    [AAAI 2026] EchoMimicV3: 1.3B Parameters are All You Need for Unified Multi-Modal and Multi-Task Human Animation

    Python1
  • hacksider/VOODOO3D-official

    Official implementation for the paper "VOODOO 3D: Volumetric Portrait Disentanglement for One-Shot 3D Head Reenactment"

    Python1
  • hacksider/Stream-DiffVSR

    The official repository of paper "Stream-DiffVSR: Low-Latency Streamable Video Super-Resolution via Auto-Regressive Diffusion"

    Python1
  • hacksider/RealVideo

    A real-time streaming conversational video system that transforms text interactions into continuous, high-fidelity video responses using autoregressive diffusion.

    Python1
  • hacksider/fantasy-portrait

    FantasyPortrait: Enhancing Multi-Character Portrait Animation with Expression-Augmented Diffusion Transformers

    1
  • hacksider/ACE-Step

    ACE-Step: A Step Towards Music Generation Foundation Model

    Python1
  • hacksider/Bagel

    Open-source unified multimodal model

    Python1
  • hacksider/Real-Time-Latent-Consistency-Model

    App showcasing multiple real-time diffusion models pipelines with Diffusers

    Python1
  • hacksider/OmniGen

    OmniGen: Unified Image Generation. https://arxiv.org/pdf/2409.11340

    Jupyter Notebook1
  • hacksider/Screen-Translator

    An Electron.js-based desktop application for automatically translating on-screen text.

    JavaScript1
  • hacksider/mslearn-deploy-run-container-app-service

    Sample code for MS Learn module "Deploy and run a containerized web application with App Service"

    C#1
  • hacksider/fetch-nft

    🖼🎑🌠 A utility to fetch and easily display Ethereum & Solana NFTs in a common format given any wallet

    TypeScript1