8000 jackie930 (Jackie Liu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View jackie930's full-sized avatar
  • Amazon Web Services

Block or report jackie930

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Solve Visual Understanding with Reinforced VLMs

Python 4,893 304 Updated Apr 21, 2025

Fully open reproduction of DeepSeek-R1

Python 24,347 2,237 Updated May 9, 2025

Official repository of ’Visual-RFT: Visual Reinforcement Fine-Tuning’

Python 1,650 76 Updated Apr 18, 2025
Python 65 1 Updated Mar 11, 2025

An open-source implementaion for fine-tuning Molmo-7B-D and Molmo-7B-O by allenai.

Python 54 5 Updated Apr 25, 2025

LLaVA-Mini is a unified large multimodal model (LMM) that can support the understanding of images, high-resolution images, and videos in an efficient manner.

Python 472 21 Updated Jan 13, 2025

Train transformer language models with reinforcement learning.

Python 13,654 1,870 Updated May 9, 2025

TimesFM (Time Series Foundation Model) is a pretrained time-series foundation model developed by Google Research for time-series forecasting.

Python 4,694 429 Updated Apr 29, 2025

Official implementation for "AutoTimes: Autoregressive Time Series Forecasters via Large Language Models"

Python 193 16 Updated Feb 15, 2025

Code release for "Learning to Generate Explainable Stock Predictions using Self-Reflective Large Language Models" https://arxiv.org/abs/2402.03659

Python 136 30 Updated May 16, 2024

LMDeploy is a toolkit for compressing, deploying, and serving LLMs.

Python 6,320 540 Updated May 8, 2025

Replace OpenAI GPT with another LLM in your app by changing a single line of code. Xinference gives you the freedom to use any LLM you need. With Xinference, you're empowered to run inference with …

Python 7,786 662 Updated May 9, 2025

Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, …

Python 7,454 635 Updated May 10, 2025

DeepSeek-VL: Towards Real-World Vision-Language Understanding

Python 3,820 565 Updated Apr 24, 2024

InternLM-XComposer2.5-OmniLive: A Comprehensive Multimodal System for Long-term Streaming Video and Audio Interactions

Python 2,822 172 Updated Apr 25, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,301 732 Updated May 4, 2025

Universal LLM Deployment Engine with ML Compilation

Python 20,581 1,721 Updated May 1, 2025

An automated pipeline for evaluating LLMs for role-playing.

Python 175 9 Updated Sep 14, 2024

The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use th…

Jupyter Notebook 15,373 1,724 Updated Dec 25, 2024

[NeurIPS 2023] Official implementation of the paper "Segment Everything Everywhere All at Once"

Python 4,580 425 Updated Aug 19, 2024

GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型

Python 6,529 549 Updated Apr 19, 2025
Python 3,781 356 Updated May 6, 2025

Ultralytics YOLO11 🚀

Python 40,603 7,850 Updated May 10, 2025
Jupyter Notebook 2 1 Updated May 1, 2023

Code & Models for Temporal Segment Networks (TSN) in ECCV 2016

Python 1,557 475 Updated Oct 27, 2020

Nightly release of ControlNet 1.1

Python 4,991 398 Updated Aug 8, 2024

VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)

Python 1 Updated Jun 7, 2024

VILA is a family of state-of-the-art vision language models (VLMs) for diverse multimodal AI tasks across the edge, data center, and cloud.

Python 3,220 264 Updated May 6, 2025
Next
0