-
Xi’an Jiaotong University
-
00:33
(UTC +08:00) - https://asterisci.github.io/
- https://group.iiis.tsinghua.edu.cn/~maks/people.html
Lists (7)
Sort Name ascending (A-Z)
Stars
Comprehensive benchmarking of protein-ligand structure prediction methods. (ICML 2024 AI4Science)
High-performance Image Tokenizers for VAR and AR
Lumina-mGPT 2.0: Stand-alone Autoregressive Image Modeling
[ArXiv 2025] WorldMem: Long-term Consistent World Simulation with Memory
Mental image reconstruction from human brain activity
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
[CVPRW 2025] UniToken is an auto-regressive generation model that combines discrete and continuous representations to process visual inputs, making it easy to integrate both visual understanding an…
Second-place solution for the Brain-to-Text Benchmark '24
RoboVerse: Towards a Unified Platform, Dataset and Benchmark for Scalable and Generalizable Robot Learning
collection of awesome research in brain decoding, including interaction with multi-modalities, theories, and foundation models.
code for AAAI2022 paper "Open Vocabulary Electroencephalography-To-Text Decoding and Zero-shot Sentiment Classification"
Using vision-language models to decode natural image perception from non-invasive brain recordings.
The Large-scale Manipulation Platform for Scalable and Intelligent Embodied Systems
[ICLR 2025] NeuroLM: A Universal Multi-task Foundation Model for Bridging the Gap between Language and EEG Signals
SoFar: Language-Grounded Orientation Bridges Spatial Reasoning and Object Manipulation
SimVQ: Addressing Representation Collapse in Vector Quantized Models with One Linear Layer
Official repo for 【FaceScore: Benchmarking and Enhancing Face Quality in Human Generation】
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, GLM4, Mistral, Yi1.5, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, …
Janus-Series: Unified Multimodal Understanding and Generation Models
[NeurIPS 2024] Official code repository for MSR3D paper
Awesome lists about framework figures in papers
[ICLR 2025] CBraMod: A Criss-Cross Brain Foundation Model for EEG Decoding
Official code repository for the paper 'EEGPT: Pretrained Transformer for Universal and Reliable Representation of EEG Signals' [NIPS 2024].
[ICLR 2024 spotlight] Large Brain Model for Learning Generic Representations with Tremendous EEG Data in BCI
[CVPR 2025] The First Investigation of CoT Reasoning in Image Generation