8000 matteosoo (Matteo Soo) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View matteosoo's full-sized avatar
:octocat:
:octocat:
  • Taipei, Taiwan
  • 18:56 (UTC +08:00)

Highlights

  • Pro

Block or report matteosoo

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A lightweight, low-dependency, unified API to use all common reranking and cross-encoder models.

Python 1,481 92 Updated May 28, 2025

This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.

1,360 314 Updated Feb 12, 2025

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

Python 8,035 661 Updated Jun 17, 2025

Python tool for converting files and office documents to Markdown.

Python 59,971 3,140 Updated Jun 4, 2025

Max搶票機器人(maxbot) help you quickly buy your tickets

Python 235 119 Updated Jan 12, 2023

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

Python 11,384 1,158 Updated Jul 4, 2025

Memory for AI Agents; Announcing OpenMemory MCP - local and secure memory management.

Python 36,255 3,707 Updated Jul 5, 2025

Enforce the output format (JSON Schema, Regex etc) of a language model

Python 1,832 78 Updated Feb 26, 2025

Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/

TypeScript 8,921 700 Updated May 8, 2025

Implementation for MatMul-free LM.

Python 3,017 189 Updated Nov 5, 2024

D.D.G.S. | Dux Distributed Global Search. A metasearch library that aggregates results from diverse web search services

Python 1,653 167 Updated Jul 6, 2025

Instruct-tune LLaMA on consumer hardware

Jupyter Notebook 18,922 2,225 Updated Jul 29, 2024

Production-ready platform for agentic workflow development.

TypeScript 105,996 16,026 Updated Jul 7, 2025

A developer reference project for creating Retrieval Augmented Generation (RAG) chatbots on Windows using TensorRT-LLM

Python 3,013 407 Updated Apr 2, 2025

Tuning and Evaluation of RAG pipeline. (Automated optimization to be added soon)

Python 264 23 Updated Mar 19, 2024

Use ArXiv ChatGuru to talk to research papers. This app uses LangChain, OpenAI, Streamlit, and Redis as a vector database/semantic cache.

Python 551 73 Updated Apr 15, 2025

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 42,890 6,163 Updated Jul 7, 2025

PyTorch implementation of VALL-E(Zero-Shot Text-To-Speech), Reproduced Demo https://lifeiteng.github.io/valle/index.html

Python 2,147 327 Updated Jun 23, 2025

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

Python 7,889 789 Updated Feb 11, 2024

Inference code for CodeLlama models

Python 16,348 1,918 Updated Aug 12, 2024

Chat with your documents on your local device using GPT models. No data leaves your device and 100% private.

Python 20,791 2,300 Updated Mar 2, 2025

SoftVC VITS Singing Voice Conversion

Python 27,339 5,009 Updated Nov 11, 2023

Core Engine of Singing Voice Conversion & Singing Voice Clone

Python 2,806 927 Updated Apr 23, 2024

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

Python 1,199 175 Updated Feb 5, 2024

PITS: Variational Pitch Inference for End-to-end Pitch-controllable TTS without External Pitch Predictor

Python 277 34 Updated Jul 16, 2023

Inference code for Llama models

Python 58,477 9,785 Updated Jan 26, 2025

XPhoneBERT: A Pre-trained Multilingual Model for Phoneme Representations for Text-to-Speech (INTERSPEECH 2023)

Python 331 40 Updated Jul 22, 2024

Official Code for DragGAN (SIGGRAPH 2023)

Python 35,921 3,440 Updated May 18, 2024

This repo is a pipeline of VITS finetuning for fast speaker adaptation TTS, and many-to-many voice conversion

Python 4,939 733 Updated Jan 21, 2025

Large-scale Self-supervised Pre-training Across Tasks, Languages, and Modalities

Python 21,486 2,640 Updated Jul 3, 2025
Next
0