8000 jun297 (Junhyeok Kim) / Starred · GitHub

More Web Proxy on the site http://driver.im/

jun297

Follow

Junhyeok Kim jun297

Follow

Ph.D. Student @ Yonsei University Personal homepage: https://junhyeok.kim

15 followers · 42 following

Achievements

Achievements

Lists (6)

Sort

💻 App

📜 Data collection

toolbox repos to collect data

✨ Inspiration

📚 Study

🛠️Tool

24 repositories

📝Writing

Stars

Liuziyu77 / Visual-RFT

Official repository of 'Visual-RFT: Visual Reinforcement Fine-Tuning' & 'Visual-ARFT: Visual Agentic Reinforcement Fine-Tuning'’

Jupyter Notebook 1,933 82 Updated May 21, 2025

steven-ccq / ViLAMP

[ICML 2025] Official repository for paper "Scaling Video-Language Models to 10K Frames via Hierarchical Differential Distillation"

Python 145 35 Updated May 15, 2025

TIGER-AI-Lab / QuickVideo

Quick Long Video Understanding

Python 39 3 Updated May 25, 2025

Peterande / D-FINE

D-FINE: Redefine Regression Task of DETRs as Fine-grained Distribution Refinement [ICLR 2025 Spotlight]

Python 2,366 210 Updated Apr 11, 2025

huggingface / large_language_model_training_playbook

An open collection of implementation tips, tricks and resources for training large language models

Python 473 23 Updated Mar 8, 2023

geohot / fromthetransistor

From the Transistor to the Web Browser, a rough outline for a 12 week course

6,207 484 Updated Oct 12, 2021

NVlabs / describe-anything

Implementation for Describe Anything: Detailed Localized Image and Video Captioning

Python 1,126 59 Updated May 6, 2025

si0wang / ThinkLite-VL

Python 79 5 Updated May 6, 2025

tanishqkumar / beyond-nanogpt

Minimal and annotated implementations of key ideas from modern deep learning research.

Jupyter Notebook 669 61 Updated Jun 2, 2025

RooCodeInc / Roo-Code

Forked from cline/cline

Roo Code (prev. Roo Cline) gives you a whole dev team of AI agents in your code editor.

TypeScript 14,875 1,541 Updated Jun 2, 2025

om-ai-lab / VLM-R1

Solve Visual Understanding with Reinforced VLMs

Python 5,042 309 Updated May 11, 2025

om-ai-lab / OVDEval

A Comprehensive Evaluation Benchmark for Open-Vocabulary Detection (AAAI 2024)

Python 50 3 Updated May 7, 2024

zolrath / obsidian-auto-link-title

Automatically fetch the titles of pasted links

TypeScript 583 69 Updated Dec 15, 2024

darlal / obsidian-switcher-plus

Enhanced Quick Switcher plugin for Obsidian.md

TypeScript 503 13 Updated May 17, 2025

alexanderswerdlow / unidisc

UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.

Python 105 5 Updated Apr 2, 2025

md-mohaiminul / BIMBA

Python 14 3 Updated Apr 8, 2025

neuroailab / Opt_CWM

Official PyTorch Implementation of Opt-CWM: Self-Supervised Learning of Motion Concepts by Optimizing Counterfactuals.

Python 19 1 Updated Mar 27, 2025

itailang / poster_guide

A conference poster format with structure, content, creation, and presentation recommendations.

60 6 Updated Feb 16, 2025

NVlabs / PS3

Scaling Vision Pre-Training to 4K Resolution

162 7 Updated May 31, 2025

saccharomycetes / mllms_know

[ICLR'25] Official code for the paper 'MLLMs Know Where to Look: Training-free Perception of Small Visual Details with Multimodal LLMs'

Python 203 9 Updated Apr 20, 2025

ByungKwanLee / DeepSick-R1

Reproduction of DeepSeek-R1

Python 231 23 Updated Apr 14, 2025

DoubtedSteam / MM-GCoT

The official implement of "Grounded Chain-of-Thought for Multimodal Large Language Models"

Python 11 1 Updated Mar 21, 2025

IDEA-Research / GroundingDINO

[ECCV 2024] Official implementation of the paper "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"

Python 8,154 822 Updated Aug 12, 2024

brentyi / tyro

CLI interfaces & config objects, from types

Python 645 32 Updated Jun 1, 2025

jmhessel / mmc4

Forked from allenai/mmc4

MultimodalC4 is a multimodal extension of c4 that interleaves millions of images with text.

Python 1 Updated Mar 15, 2025

cornstarch-org / Cornstarch

Python 93 5 Updated May 27, 2025

oven-sh / bun

Incredibly fast JavaScript runtime, bundler, test runner, and package manager – all in one

Zig 78,379 3,123 Updated Jun 2, 2025

ghostty-org / ghostty

👻 Ghostty is a fast, feature-rich, and cross-platform terminal emulator that uses platform-native UI and GPU acceleration.

Zig 31,126 839 Updated Jun 2, 2025

JamesCXH / research-ideas

Jupyter Notebook 6 Updated May 30, 2025

video-db / StreamRAG

Video Search and Streaming Agent 🕵️‍♂️

Python 469 31 Updated Jan 31, 2024

0