8000 hulk006 / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View hulk006's full-sized avatar

Block or report hulk006

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Open-source and strong foundation image recognition models.

Jupyter Notebook 3,233 302 Updated Feb 18, 2025

Rankings include: Align3R BetterDepth ChronoDepth CUT3R Deep3D Depth Any Video Depth Anything Depth Pro DepthCrafter Geo4D GRIN L4P MASt3R Metric3D Metric-Solver MoGe MonST3R NVDS RollingDepth Ster…

146 1 Updated May 17, 2025

Python3 package for Chinese/English OCR, with paddleocr-v4 onnx model(~14MB). 基于ppocr-v4-onnx模型推理,可实现 CPU 上毫秒级的 OCR 精准预测,通用场景中英文OCR达到开源SOTA。

Python 82 11 Updated Jan 22, 2025

Local Deployment of OmniParser v2.0 with pyautogui for True Automated Clicking!

Python 33 8 Updated Feb 19, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 2,954 223 Updated May 16, 2025

A simple screen parsing tool towards pure vision based GUI agent

Jupyter Notebook 22,095 1,853 Updated Mar 26, 2025

Awesome multilingual OCR toolkits based on PaddlePaddle (practical ultra lightweight OCR system, support 80+ languages recognition, provide data annotation and synthesis tools, support training and…

Python 49,180 8,205 Updated May 15, 2025

Texas Poker Multi-Agent Game/多智能体德州扑克游戏

Roff 10 2 Updated Mar 11, 2025

FLUX, Stable Diffusion, SDXL, SD3, LoRA, Fine Tuning, DreamBooth, Training, Automatic1111, Forge WebUI, SwarmUI, DeepFake, TTS, Animation, Text To Video, Tutorials, Guides, Lectures, Courses, Comfy…

Jupyter Notebook 2,427 329 Updated May 17, 2025

PE3R: Perception-Efficient 3D Reconstruction. Take 2 - 3 photos with your phone, upload them, wait a few minutes, and then start exploring your 3D world via text!

Python 348 13 Updated Apr 1, 2025

Official implementations for paper: VACE: All-in-One Video Creation and Editing

Python 1,754 82 Updated May 15, 2025

Open-Sora: Democratizing Efficient Video Production for All

Python 26,432 2,554 Updated Apr 30, 2025

The fastest digital human algorithm, now on your desktop.

Python 514 61 Updated Dec 29, 2024

最简易的R1结果在小模型上的复现,阐述类O1与DeepSeek R1最重要的本质。Think is all your need。利用实验佐证,对于强推理能力,think思考过程性内容是AGI/ASI的核心。

Python 45 7 Updated Feb 8, 2025

Real time interactive streaming digital human

Python 5,589 832 Updated May 18, 2025

[CVPR 2025] EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation

Python 3,740 445 Updated Feb 27, 2025

OpenStereo: A Comprehensive Benchmark for Stereo Matching and Strong Baseline

Python 649 71 Updated May 14, 2025

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

Jupyter Notebook 16,316 1,490 Updated Sep 5, 2024

DINO-X: The World's Top-Performing Vision Model for Open-World Object Detection and Understanding

Python 1,043 39 Updated Apr 21, 2025

The official code for “Recurrent Generic Contour-based Instance Segmentation with Progressive Learning”, TCSVT, 2024.

Python 76 7 Updated Mar 9, 2025

YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]

Python 10,718 1,094 Updated Mar 14, 2025

推荐系统入门教程,在线阅读地址:https://datawhalechina.github.io/fun-rec/

Jupyter Notebook 5,715 918 Updated Feb 22, 2025

A collaboration friendly studio for NeRFs

Python 10,215 1,420 Updated May 6, 2025

OpenMMLab Semantic Segmentation Toolbox and Benchmark.

Jupyter Notebook 1 Updated Jun 12, 2023

Concurrently chat with ChatGPT, Bing Chat, Bard, Alpaca, Vicuna, Claude, ChatGLM, MOSS, 讯飞星火, 文心一言 and more, discover the best answers

JavaScript 15,807 1,674 Updated Apr 20, 2025

Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]

Python 906 49 Updated Jul 6, 2024

翻墙-科学上网

Kotlin 39,705 7,355 Updated May 13, 2025

A Pytorch implementation of CASENet for the Cityscapes Dataset

Python 85 12 Updated Jul 31, 2019

Semantic Segmentation on PyTorch (include FCN, PSPNet, Deeplabv3, Deeplabv3+, DANet, DenseASPP, BiSeNet, EncNet, DUNet, ICNet, ENet, OCNet, CCNet, PSANet, CGNet, ESPNet, LEDNet, DFANet)

Python 2,976 586 Updated Jan 4, 2023
Next
0