8000 glunce / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View glunce's full-sized avatar

Block or report glunce

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results
Python 1,368 46 Updated May 23, 2025

The official implement of "VisionReasoner: Unified Visual Perception and Reasoning via Reinforcement Learning"

Python 81 5 Updated May 21, 2025

https://huggingface.co/spaces/csgaobb/AdaptCLIP

39 1 Updated May 20, 2025

New generation of CLIP with fine grained discrimination capability, ICML2025

Python 127 6 Updated May 21, 2025

A powerful tool for creating fine-tuning datasets for LLM

JavaScript 7,383 756 Updated May 22, 2025

Suna - Open Source Generalist AI Agent

TypeScript 12,505 1,773 Updated May 23, 2025

MAGI-1: Autoregressive Video Generation at Scale

Python 3,117 169 Updated May 21, 2025

an AI based quant trading platform

Python 54 3 Updated Apr 16, 2025

A GUI Agent application based on UI-TARS(Vision-Language Model) that allows you to control your computer using natural language.

TypeScript 14,137 1,166 Updated May 23, 2025

Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.

Jupyter Notebook 2,998 228 Updated May 19, 2025

QwQ is the reasoning model series developed by Qwen team, Alibaba Cloud.

Python 495 16 Updated Mar 27, 2025

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 16,298 1,676 Updated Apr 12, 2025

Open source alternative to Gemini Deep Research. Generate reports with AI based on search results.

TypeScript 1,913 186 Updated Mar 15, 2025

Keep searching, reading webpages, reasoning until it finds the answer (or exceeding the token budget)

TypeScript 4,286 391 Updated May 22, 2025

🦉 OWL: Optimized Workforce Learning for General Multi-Agent Assistance in Real-World Task Automation

Python 16,599 1,950 Updated May 23, 2025

No fortress, purely open ground. OpenManus is Coming.

Python 45,987 8,023 Updated May 20, 2025

This is the official repository for Retrieval Augmented Visual Question Answering

Python 227 18 Updated Dec 19, 2024

The official implementation of RAR

Python 87 1 Updated Mar 27, 2024

YOLOE: Real-Time Seeing Anything

Python 1,266 106 Updated May 3, 2025

Solve Visual Understanding with Reinforced VLMs

Python 4,981 308 Updated May 11, 2025

[ACL2025 Findings] Migician: Revealing the Magic of Free-Form Multi-Image Grounding in Multimodal Large Language Models

Python 56 3 Updated May 20, 2025

A CPU Realtime VLM in 500M. Surpassed Moondream2 and SmolVLM. Training from scratch with ease.

Python 198 20 Updated Apr 22, 2025

Automate the process of making money online.

Python 11,484 1,099 Updated Mar 20, 2025

"MiniRAG: Making RAG Simpler with Small and Free Language Models"

Python 1,108 129 Updated May 12, 2025

Agentic-RAG explores advanced Retrieval-Augmented Generation systems enhanced with AI LLM agents.

778 111 Updated Feb 5, 2025

Code for the paper "FinRL-DeepSeek: LLM-Infused Risk-Sensitive Reinforcement Learning for Trading Agents" arXiv:2502.07393

Jupyter Notebook 206 67 Updated Apr 8, 2025
Next
0