Starred repositories
A high-throughput and memory-efficient inference and serving engine for LLMs
🔥 [ICLR 2025] FakeShield: Explainable Image Forgery Detection and Localization via Multi-modal Large Language Models
A powerful tool for creating fine-tuning datasets for LLM
Data processing for and with foundation models! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷
Label Studio is a multi-type data labeling and annotation tool with standardized output format
Convert PDF to markdown + JSON quickly with high accuracy
Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
🍒 Cherry Studio is a desktop client that supports for multiple LLM providers.
Get up and running with Llama 3.3, DeepSeek-R1, Phi-4, Gemma 3, Mistral Small 3.1 and other large language models.
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
简体中文版 ComfyUI
EfficientSAM: Leveraged Masked Image Pretraining for Efficient Segment Anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Labeling tool with SAM(segment anything model),supports SAM, SAM2, sam-hq, MobileSAM EdgeSAM etc.交互式半自动图像标注工具
IDM Activation & Trail Reset Script
FongMi影视和tvbox配置文件,如果喜欢,请Fork自用。使用前请仔细阅读仓库说明,一旦使用将被视为你已了解。
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
This code corresponds to simulation environments used as part of the MimicGen project.
Code for the ICCV 2021 paper "Pixel Difference Networks for Efficient Edge Detection" (Oral).
🚀🚀🚀 YOLO series of PaddlePaddle implementation, PP-YOLOE+, RT-DETR, YOLOv5, YOLOv6, YOLOv7, YOLOv8, YOLOv10, YOLO11, YOLOX, YOLOv5u, YOLOv7u, YOLOv6Lite, RTMDet and so on. 🚀🚀🚀
YOLOv5🚀 reproduction by Guo Quanhao using PaddlePaddle
The simplest, fastest repository for training/finetuning medium-sized GPTs.
✯ 可直连访问的电视/广播图标库与相关工具项目 ✯ 🔕 永久免费 直连访问 完整开源 不断完善的台标 支持IPv4/IPv6双栈访问 🔕
This repo includes ChatGPT prompt curation to use ChatGPT and other LLM tools better.
uBlock Origin - An efficient blocker for Chromium and Firefox. Fast and lean.
直播源相关资源汇总 📺 💯 IPTV、M3U —— 勤洗手、戴口罩,祝愿所有人百毒不侵