-
National University Of Singapore
- Singapore
Lists (4)
Sort Name ascending (A-Z)
Stars
卡拉彼丘琴房助手(模拟鼠标移动自动化弹琴)Strinova Piano Room Assistant (Automatically plays the piano with simulated mouse movement)
Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ
Solution of Kaggle competition: WSDM Cup - Multilingual Chatbot Arena
StarVector is a foundation model for SVG generation that transforms vectorization into a code generation task. Using a vision-language modeling architecture, StarVector processes both visual and te…
This repository contains AI models that identify deceptive content and combat misinformation
Collect some World Models for Autonomous Driving (and Robotic) papers.
Experiments on continuous LMs.
A benchmark dataset for evaluating LLM's SVG editing capabilities
A curated list of early exiting (LLM, CV, NLP, etc)
GeometryCrafter: Consistent Geometry Estimation for Open-world Videos with Diffusion Priors
Enjoy the magic of Diffusion models!
BlackTea-c / Spark-TTS-repro
Forked from SparkAudio/Spark-TTSSpark-TTS Inference Code reproduce
HunyuanVideo GP: Large Video Generation Model - GPU Poor version
[ACM MM'2024]"DiffMM: Multi-Modal Diffusion Model for Recommendation"
[ACM MM 2024] Mamba3D: Enhancing Local Features for 3D Point Cloud Analysis via State Space Model
[Paper][ACM MM 2024] Making Large Language Models Perform Better in Knowledge Graph Completion
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
ACM MM 2024 FlashSpeech: Efficient Zero-Shot Speech Synthesis
[ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
fork of open-r1 repo.try to reproduce in a different dirver and conda environment
Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
[CVPR 2025 Highlight] Video Depth Anything: Consistent Depth Estimation for Super-Long Videos
Fully open reproduction of DeepSeek-R1