Stars
Here is some computer interview knowledge
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
how to optimize some algorithm in cuda.
VARGPT: Unified Understanding and Generation in a Visual Autoregressive Multimodal Large Language Model
OpenR: An Open Source Framework for Advanced Reasoning with Large Language Models
SGLang is a fast serving framework for large language models and vision language models.
A curated collection of research papers exploring the utilization of LLMs for graph-related tasks.
本项目旨在分享大模型相关技术原理以及实战经验(大模型工程化、大模型应用落地)
Awesome machine learning for combinatorial optimization papers.
Graphic notes on Gilbert Strang's "Linear Algebra for Everyone"
Awesome Papers About Performing Prompting On Graphs
Accompanied repositories for our paper Graph foundation model
This repository mainly houses accepted papers from CCF-A conferences in recent years, including ICLR, AAAI, IJCAI, NIPS, and ICML, and is used for quick browsing for cutting-edge information.
Unified Efficient Fine-Tuning of 100+ LLMs & VLMs (ACL 2024)
Single Image to 3D using Cross-Domain Diffusion for 3D Generation
TextStarCraft2,a pure language env which support llms play starcraft2
A RWKV management and startup tool, full automation, only 8MB. And provides an interface compatible with the OpenAI API. RWKV is a large language model that is fully open source and available for c…
A curated list of reinforcement learning with human feedback resources (continually updated)
This code is used to generate presentation results in GIF format, not through a web server. Only suitable for MAgent.
为GPT/GLM等LLM大语言模型提供实用化交互接口,特别优化论文阅读/润色/写作体验,模块化设计,支持自定义快捷按钮&函数插件,支持Python和C++等项目剖析&自译解功能,PDF/LaTex论文翻译&总结功能,支持并行问询多种LLM模型,支持chatglm3等本地模型。接入通义千问, deepseekcoder, 讯飞星火, 文心一言, llama2, rwkv, claude2, m…
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
Concise pytorch implements of MARL algorithms, including MAPPO, MADDPG, MATD3, QMIX and VDN.
Multiagent Reinforcement Learning Research Project
PyTorch implementation of Soft Actor-Critic (SAC), Twin Delayed DDPG (TD3), Actor-Critic (AC/A2C), Proximal Policy Optimization (PPO), QT-Opt, PointNet..
Simple (but often Strong) Baselines for POMDPs in PyTorch, ICML 2022