hulk006

hulk006

0 followers · 15 following

Stars

xinyu1205 / recognize-anything

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,233 302 Updated Feb 18, 2025

AIVFI / Monocular-Depth-Estimation-Rankings-and-2D-to-3D-Video-Conversion-Rankings

Rankings include: Align3R BetterDepth ChronoDepth CUT3R Deep3D Depth Any Video Depth Anything Depth Pro DepthCrafter Geo4D GRIN L4P MASt3R Metric3D Metric-Solver MoGe MonST3R NVDS RollingDepth Ster…

146 1 Updated May 17, 2025

shibing624 / imgocr

Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理，可实现 CPU 上毫秒级的 OCR 精准预测，通用场景中英文OCR达到开源SOTA。

Python 82 11 Updated Jan 22, 2025

nordeim / OmniParser2.0_Pyautogui

Local Deployment of OmniParser v2.0 with pyautogui for True Automated Clicking!

Python 33 8 Updated Feb 19, 2025

QwenLM / Qwen2.5-Omni

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 2,954 223 Updated May 16, 2025

microsoft / OmniParser

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,095 1,853 Updated Mar 26, 2025

PaddlePaddle / PaddleOCR

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 49,180 8,205 Updated May 15, 2025

YiCheng996 / TexasHoldemAgent

Texas Poker Multi-Agent Game/多智能体德州扑克游戏

Roff 10 2 Updated Mar 11, 2025

FurkanGozukara / Stable-Diffusion

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…

Jupyter Notebook 2,427 329 Updated May 17, 2025

hujiecpp / PE3R

PE3R: Perception-Efficient 3D Reconstruction. Take 2 - 3 photos with your phone, upload them, wait a few minutes, and then start exploring your 3D world via text!

Python 348 13 Updated Apr 1, 2025

ali-vilab / VACE

Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 1,754 82 Updated May 15, 2025

hpcaitech / Open-Sora

Open-Sora: Democratizing Efficient Video Production for All

Python 26,432 2,554 Updated Apr 30, 2025

kleinlee / MiniMates

The fastest digital human algorithm, now on your desktop.

Python 514 61 Updated Dec 29, 2024

StarRing2022 / R1-Nature

最简易的R1结果在小模型上的复现，阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证，对于强推理能力，think思考过程性内容是AGI/ASI的核心。

Python 45 7 Updated Feb 8, 2025

lipku / LiveTalking

Real time interactive streaming digital human

Python 5,589 832 Updated May 18, 2025

antgroup / echomimic_v2

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,740 445 Updated Feb 27, 2025

XiandaGuo / OpenStereo

OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline

Python 649 71 Updated May 14, 2025

IDEA-Research / Grounded-Segment-Anything

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,316 1,490 Updated Sep 5, 2024

IDEA-Research / DINO-X-API

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,043 39 Updated Apr 21, 2025

fh2019ustc / PolySnake

The official code for “Recurrent Generic Contour-based Instance Segmentation with Progressive Learning”, TCSVT, 2024.

Python 76 7 Updated Mar 9, 2025

THU-MIG / yolov10

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,718 1,094 Updated Mar 14, 2025

7799

datawhalechina / fun-rec

推荐系统入门教程，在线阅读地址：https://datawhalechina.github.io/fun-rec/

Jupyter Notebook 5,715 918 Updated Feb 22, 2025

nerfstudio-project / nerfstudio

A collaboration friendly studio for NeRFs

Python 10,215 1,420 Updated May 6, 2025

zzubqh / Mask2Former-Simplify

Python 146 16 Updated Dec 6, 2023

hulk006 / mmsegmentation

Forked from open-mmlab/mmsegmentation

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Jupyter Notebook 1 Updated Jun 12, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly