voidnik

Richard Jaeho Hur voidnik

0 followers · 8 following

Achievements

Lists (30)

Sort

Stars

iamgio / quarkdown

🪐 Markdown with superpowers — from ideas to papers, presentations and books.

Kotlin 8,256 173 Updated Jul 4, 2025

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,568 1,510 Updated Sep 5, 2024

ByteDance-Seed / Bagel

Open-source unified multimodal model

Python 4,465 375 Updated Jul 2, 2025

portu-sim / comfyui_bmab

BMAB for ComfyUI

Python 108 8 Updated Feb 23, 2025

tidalcycles / strudel

MOVED TO CODEBERG - Web-based environment for live coding algorithmic patterns, incorporating a faithful port of TidalCycles to JavaScript

2,120 233 Updated Jun 19, 2025

kijai / ComfyUI-Florence2

Inference Microsoft Florence2 VLM

Python 1,317 99 Updated May 21, 2025

huggingface / transformers

🤗 Transformers: the model-definition framework for state-of-the-art machine learning models in text, vision, audio, and multimodal models, for both inference and training.

Python 146,480 29,537 Updated Jul 5, 2025

anyantudre / Florence-2-Vision-Language-Model

Florence-2 is a novel vision foundation model with a unified, prompt-based representation for a variety of computer vision and vision-language tasks.

Jupyter Notebook 78 8 Updated Jul 3, 2024

fishaudio / fish-speech

SOTA Open Source TTS

Python 22,200 1,819 Updated Jul 2, 2025

PKU-YuanGroup / Open-Sora-Plan

This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.

Python 11,993 1,057 Updated Jul 1, 2025

Phantom-video / Phantom

Phantom: Subject-Consistent Video Generation via Cross-Modal Alignment

Python 1,265 80 Updated Jun 27, 2025

mainframecomputer / fullmoon-ios

chat with private and local large language models

Swift 2,041 196 Updated May 5, 2025

csjliang / LDL

Official implementation of the paper 'Details or Artifacts: A Locally Discriminative Learning Approach to Realistic Image Super-Resolution' in CVPR 2022

Python 277 24 Updated Apr 11, 2022

SesameAILabs / csm

A Conversational Speech Generation Model

Python 13,667 1,334 Updated May 27, 2025

stanfordnlp / dspy

DSPy: The framework for programming—not prompting—language models

Python 26,097 2,006 Updated Jul 5, 2025

nari-labs / dia

A TTS model capable of generating ultra-realistic dialogue in one pass.

Python 17,333 1,427 Updated Jun 28, 2025

bytedance / InfiniteYou

🔥 [ICCV 2025] InfiniteYou: Flexible Photo Recrafting While Preserving Your Identity

Python 2,491 267 Updated Jun 28, 2025

deepseek-ai / DeepSeek-R1

90,404 11,664 Updated Jun 27, 2025

andrewyng / aisuite

Simple, unified interface to multiple Generative AI providers

8000 Python 12,207 1,219 Updated Jul 3, 2025

Curt-Park / yolo-world-with-efficientvit-sam

YOLO-World + EfficientViT SAM

Python 101 12 Updated Feb 18, 2024

Lightricks / LTX-Video

Official repository for LTX-Video

Python 6,878 587 Updated Jul 4, 2025

XiaomiMiMo / MiMo

MiMo: Unlocking the Reasoning Potential of Language Model – From Pretraining to Posttraining

Python 1,485 64 Updated Jun 5, 2025

fal-ai / f-lite

F Lite is a 10B parameter diffusion model created by Freepik and Fal, trained exclusively on copyright-safe and SFW content.

Python 394 37 Updated Jun 17, 2025

NVlabs / VILA

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,373 272 Updated Jun 19, 2025

SandAI-org / MAGI-1

MAGI-1: Autoregressive Video Generation at Scale

Python 3,352 194 Updated Jun 17, 2025

ZHO-ZHO-ZHO / ComfyUI-YoloWorld-EfficientSAM

Unofficial implementation of YOLO-World + EfficientSAM for ComfyUI

Python 745 69 Updated May 22, 2024

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,377 847 Updated Aug 12, 2024

cyrildiagne / ar-cutpaste

Cut and paste your surroundings using AR

TypeScript 14,660 2,060 Updated Mar 4, 2023

facebookresearch / segment-anything

The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.

Jupyter Notebook 50,715 5,964 Updated Sep 18, 2024

ycyy / ComfyUI-Yolo-World-EfficientSAM

ComfyUI Yolo World EfficientSAM custom node

Python 15 4 Updated Jul 16, 2024

Richard Jaeho Hur voidnik

Lists (30)

Book

C/C++

Colorization

ComfyUI

Computer Vision

Dataset

Deep Learning

Emacs

Fonts

Games

i3wm

Image/Video Generation

Image/Video Restoration

Inpainting

Language Models

Mathematics

Media

Metrics

Multimodal Foundation Models

Obsidian

Programming Languages

Python

PyTorch

Rust

Speech and Audio

Stable Diffusion

Transformer

Utils

ViT

Web

Stars