Highlights
- Pro
Stars
Use PEFT or Full-parameter to CPT/SFT/DPO/GRPO 500+ LLMs (Qwen3, Qwen3-MoE, Llama4, InternLM3, DeepSeek-R1, ...) and 200+ MLLMs (Qwen2.5-VL, Qwen2.5-Omni, Qwen2-Audio, Ovis2, InternVL3, Llava, GLM4…
Sharingan: A Transformer Architecture for Multi-Person Gaze Following
Gaze-LLE: Gaze Target Estimation via Large-Scale Learned Encoders (CVPR 2025, Highlight)
Official code of "ViTGaze: Gaze Following with Interaction Features in Vision Transformers"
推荐系统入门教程,在线阅读地址:https://datawhalechina.github.io/fun-rec/
天池大赛——新闻推荐场景下的用户行为预测挑战赛,SOLO赛,B榜排名5/5338
This repo provides the training and testing code for our paper "A Modular Multimodal Architecture for Gaze Target Prediction: Application to Privacy-Sensitive Settings" published at the GAZE worksh…
FineDiving: A Fine-grained Dataset for Procedure-aware Action Quality Assessment
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
Offical implemention of the paper DiffSal: Joint Audio and Video Learning for Diffusion Saliency Prediction
Code implementation for the paper "Patch-level Gaze Distribution Prediction for Gaze Following"
Unofficial implementation of "SODA: Bottleneck Diffusion Models for Representation Learning"
High-Resolution Image Synthesis with Latent Diffusion Models
High-Resolution Image Synthesis with Latent Diffusion Models
Official PyTorch implementation of ODISE: Open-Vocabulary Panoptic Segmentation with Text-to-Image Diffusion Models [CVPR 2023 Highlight]
Code for Stable Control Representations
PyTorch implementation for DDPM & DDIM
500 行代码实现降噪扩散模型 DDPM,干净无依赖
Daily feed of the latest Computer Vision research papers from https://arxiv.org.
[CVPR2023] NeRF-RPN: A general framework for object detection in NeRFs
The official PyTorch implementation of img2pose: Face Alignment and Detection via 6DoF, Face Pose Estimation - CVPR 2021
Official implementation of FaceXFormer: A Unified Transformer for Facial Analysis