- Taiwan
-
11:32
(UTC +08:00)
Highlights
- Pro
Stars
A collection of open datasets for industrial applications, divided by categories
Get your documents ready for gen AI
A collection of prompts, system prompts and LLM instructions
SuperPrompt is an attempt to engineer prompts that might help us understand AI agents.
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS…
PyTorch Implementation of AudioLCM (ACM-MM'24): a efficient and high-quality text-to-audio generation with latent consistency model.
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Build your own Face App with Stable Diffusion 2.1
This repository contains the dataset including the pair of 2D face image and its corresponding 3D face geometry model.
A generative speech model for daily dialogue.
Convert PDF to markdown + JSON quickly with high accuracy
⚡FlashRAG: A Python Toolkit for Efficient RAG Research (WWW2025 Resource)
A Curated List of Awesome Table Structure Recognition (TSR) Research. Including models, papers, datasets and codes. Continuously updating.
超轻量级中文ocr,支持竖排文字识别, 支持ncnn、mnn、tnn推理 ( dbnet(1.8M) + crnn(2.5M) + anglenet(378KB)) 总模型仅4.7M
FinRobot: An Open-Source AI Agent Platform for Financial Analysis using LLMs 🚀 🚀 🚀
This repository contains the code and implementation details of the CascadeTabNet paper "CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents"
Code Repository for CVPR 2023 Paper "PanoHead: Geometry-Aware 3D Full-Head Synthesis in 360 degree"
2nd solution of ICDAR 2021 Competition on Scientific Literature Parsing, Task B.
Convert documents to structured data effortlessly. Unstructured is open-source ETL solution for transforming complex documents into clean, structured formats for language models. Visit our website …
Official Pytorch Implementation of 3DV2021 paper: SAFA: Structure Aware Face Animation.
Python tools for 3D face: 3DMM, Mesh processing(transform, camera, light, render), 3D face representations.
Graph based retrieval + GenAI = Better RAG in production
Custom GPT Showcase, featuring advanced workflows and operational logic.
[ECCV 2024] Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance