-
NAVER Cloud, Foundation Research Team
- Seoul, Korea
- https://goddoe.github.io
Lists (3)
Sort Name ascending (A-Z)
Stars
- All languages
- BibTeX Style
- C
- C#
- C++
- CSS
- Cuda
- Cython
- Dart
- Go
- HTML
- Haskell
- Java
- JavaScript
- Jsonnet
- Jupyter Notebook
- Kotlin
- Lua
- MATLAB
- MDX
- MLIR
- Markdown
- Nim
- OCaml
- Objective-C
- OpenEdge ABL
- PHP
- Perl
- Pug
- Python
- QML
- Rich Text Format
- Roff
- Ruby
- Rust
- SCSS
- Scala
- Shell
- Starlark
- Swift
- TeX
- TypeScript
- Vim Script
- Vue
DeerFlow is a community-driven Deep Research framework, combining language models with tools like web search, crawling, and Python execution, while contributing back to the open-source community.
A Flexible Framework for Experiencing Cutting-edge LLM Inference Optimizations
Chat with your database or your datalake (SQL, CSV, parquet). PandasAI makes data analysis conversational using LLMs and RAG.
Kimi-Audio, an open-source audio foundation model excelling in audio understanding, generation, and conversation
A scalable asynchronous reinforcement learning implementation with in-flight weight updates.
Implementing DeepSeek R1's GRPO algorithm from scratch
Sim Studio is an open-source AI agent workflow builder. Sim Studio's interface is a lightweight, intuitive way to quickly build and deploy LLMs that connect with your favorite tools.
Lightweight coding agent that runs in your terminal
Textbook on reinforcement learning from human feedback
prime-rl is a codebase for decentralized RL training at scale
verl: Volcano Engine Reinforcement Learning for LLMs
Democratizing Reinforcement Learning for LLMs
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
dataset for lightly supervised training using the librivox audio book recordings. https://librivox.org/.
The python library for real-time communication
Qwen2.5-Omni is an end-to-end multimodal model by Qwen team at Alibaba Cloud, capable of understanding text, audio, vision, video, and performing real-time speech generation.
Unified framework for robot learning built on NVIDIA Isaac Sim
OmniMamba: Efficient and Unified Multimodal Understanding and Generation via State Space Models
An evolving, large-scale and multi-domain ASR corpus for low-resource languages with automated crawling, transcription and refinement
MM-EUREKA: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning
LLaSE-G1: Incentivizing Generalization Capability for LLaMA-based Speech Enhancement
Node Version Manager - POSIX-compliant bash script to manage multiple active node.js versions
[CVPR 2024 Highlight] GenAD: Generalized Predictive Model for Autonomous Driving & Foundation Models in Autonomous System
The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
A Conversational Speech Generation Model
A Neovim plugin to copy text through SSH with OSC52
Local Deep Research is an AI-powered assistant that transforms complex questions into comprehensive, cited reports by conducting iterative analysis using any LLM across diverse knowledge sources in…