8000 scaler2017 (scaler2017) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View scaler2017's full-sized avatar

Block or report scaler2017

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Seed1.5-VL, a vision-language foundation model designed to advance general-purpose multimodal understanding and reasoning, achieving state-of-the-art performance on 38 out of 60 public benchmarks.

Jupyter Notebook 1,159 42 Updated May 21, 2025

Open-Qwen2VL: Compute-Efficient Pre-Training of Fully-Open Multimodal LLMs on Academic Resources

Python 218 8 Updated May 17, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 46,512 8,137 Updated Jun 5, 2025

OpenOCR: A general OCR system with accuracy and efficiency. Supporting 24 Scene Text Recognition methods trained from scratch on large-scale real datasets, and will continue to add the latest methods.

Python 655 53 Updated May 21, 2025

Official Pytorch implementations of CDistNet: Perceiving Multi-Domain Character Distance for Robust Text Recognition(IJCV)

Jupyter Notebook 113 18 Updated Aug 10, 2023

CMMLU: Measuring massive multitask language understanding in Chinese

Python 765 63 Updated Dec 6, 2024

A framework for few-shot evaluation of language models.

Python 9,154 2,441 Updated Jun 5, 2025

Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)

Python 51,665 6,246 Updated Jun 5, 2025

Toolkit for linearizing PDFs for LLM datasets/training

Python 12,748 903 Updated Jun 5, 2025

Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.

C++ 3,365 276 Updated Jun 5, 2025

Sky-T1: Train your own O1 preview model within $450

Python 3,263 324 Updated May 18, 2025

The off AA3F icial repo of Qwen (通义千问) chat & pretrained large language model proposed by Alibaba Cloud.

Python 18,436 1,512 Updated Apr 29, 2025

FaceChain is a deep-learning toolchain for generating your Digital-Twin.

Jupyter Notebook 9,431 884 Updated Apr 9, 2025

DeepFaceLab is the leading software for creating deepfakes.

Python 18,122 537 Updated Nov 13, 2024

[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis

Python 1,183 143 Updated Mar 14, 2025

每个人都能用的数字人

Python 1,466 309 Updated May 30, 2025

小报童精选推荐,节省您的时间,推荐最流行的副业项目小报童,AI项目小报童

6 Updated Oct 7, 2024

永久免费开源的 AIGC 课程, 目前已支持Prompt Engineering, ChatGPT, Midjourney, Runway, Stable Diffusion, AI数字人,AI声音&音乐,开源大模型

JavaScript 2,107 181 Updated Apr 13, 2025

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,296 267 Updated Jun 5, 2025

Yi-1.5 is an upgraded version of Yi, delivering stronger performance in coding, math, reasoning, and instruction-following capability.

553 34 Updated Nov 11, 2024

ALIbaba's Collection of Encoder-decoders from MinD (Machine IntelligeNce of Damo) Lab

Python 2,045 301 Updated Mar 19, 2024

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 3,860 569 Updated Apr 24, 2024

DeepSeek-VL2: Mixture-of-Experts Vision-Language Models for Advanced Multimodal Understanding

Python 4,868 1,751 Updated Feb 26, 2025

JARVIS, a system to connect LLMs with ML community. Paper: https://arxiv.org/pdf/2303.17580.pdf

Python 24,172 2,024 Updated Sep 26, 2024

🔮 ChatGPT Desktop Application (Mac, Windows and Linux)

Rust 53,799 6,111 Updated Aug 29, 2024
Next
0