Lists (8)
Sort Name ascending (A-Z)
Stars
Official code implementation for the ACL 2025 paper: 'Dynamic Scaling of Unit Tests for Code Reward Modeling'
ZeroSearch: Incentivize the Search Capability of LLMs without Searching
The official repo for "VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search"
General Reasoner: Advancing LLM Reasoning Across All Domains
🔱 Sailor2: Sailing in South-East Asia with Inclusive Multilingual LLMs
Source code for our ICML'25 paper
YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open
generate lyrics, song and background music(instrumental). Model Context Protocol (MCP) server.
Colab adaptation of MVSep Model for MDX23 music separation contest
GUI for a Vocal Remover that uses Deep Neural Networks.
Use API to call the music generation AI of suno.ai, and easily integrate it into agents like GPTs.
A lightweight adjustment tool for smoothing token probabilities in the Qwen models to encourage balanced multilingual generation.
CleverBee - The Open Source Deep Researcher Tool
RM-R1: Unleashing the Reasoning Potential of Reward Models
Paper "Multi-Agent System for Comprehensive Soccer Understanding"
BRIGHT: A Realistic and Challenging Benchmark for Reasoning-Intensive Retrieval
Introducing Filtered Direct Preference Optimization (fDPO) that enhances language model alignment with human preferences by discarding lower-quality samples compared to those generated by the learn…
[ACL 2025 Main] SongComposer: A Large Language Model for Lyric and Melody Generation in Song Composition
Muzic: Music Understanding and Generation with Artificial Intelligence
huggingface / yourbench
Forked from sumukshashidhar/yourbench🤗 Benchmark Large Language Models Reliably On Your Data
FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent, VSCode Agent, Dia Browser & Trae AI (And other Open Sourced) System Prompts, Tools & AI Models.
We systematically studied the influencing factors when LLM generates benchmarks,By using our code, you can generate high-quality QA datasets
The official repository of our survey paper: "Towards a Unified View of Preference Learning for Large Language Models: A Survey"
[EMNLP-2024] ⚓️ Sailor: Open Language Models for South-East Asia