8000 YuChuang1205 (Chuang Yu) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View YuChuang1205's full-sized avatar
🎯
Focusing learning
🎯
Focusing learning

Block or report YuChuang1205

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

PyTorch code for binary segmentation on CelebAMask-HQ dataset via both a UNet written from scratch and a pretrained DeepLabv3 model.

Jupyter Notebook 7 Updated Feb 25, 2021

A large-scale face dataset for face parsing, recognition, generation and editing.

Python 2,211 354 Updated Jun 20, 2024

Collection of awesome medical dataset resources.

936 77 Updated Jan 23, 2025

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 21,223 1,395 Updated May 16, 2025
Python 11,153 994 Updated Apr 16, 2025

The MCP Code Executor is an MCP server that allows LLMs to execute Python code within a specified Conda environment.

JavaScript 75 16 Updated May 13, 2025

Scaling Inference-Time Optimization for Text-to-Image Diffusion Models via Reflection Tuning

Python 136 9 Updated May 12, 2025
Python 19 3 Updated Jun 13, 2024

[NeurIPS 2023]DDCoT: Duty-Distinct Chain-of-Thought Prompting for Multimodal Reasoning in Language Models

Python 43 1 Updated Mar 18, 2024
Python 58 Updated Mar 10, 2025

A Self-Training Framework for Vision-Language Reasoning

Python 78 1 Updated Jan 23, 2025

Official implementation for "Multimodal Chain-of-Thought Reasoning in Language Models" (stay tuned and more will be updated)

Python 3,923 323 Updated Jun 12, 2024

Contrastive Chain-of-Thought Prompting

Python 61 5 Updated Nov 18, 2023

Code for AAAI'24 paper "Graph Neural Prompting with Large Language Models".

Python 22 4 Updated Apr 29, 2025

Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.

Python 2,484 277 Updated May 16, 2025

Qwen2.5-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.

Jupyter Notebook 10,418 746 Updated May 15, 2025

[CVPR2025 Highlight] Insight-V: Exploring Long-Chain Visual Reasoning with Multimodal Large Language Models

Python 190 3 Updated Apr 4, 2025

[NeurIPS 2024] HairDiffusion: Vivid Multi-Colored Hair Editing via Latent Diffusion

4 Updated Nov 13, 2024

😎 A list of awesome scene understanding papers.

763 97 Updated May 14, 2025

Official implementation for "Enhancing Semantics in Multimodal Chain of Thought via Soft Negative Sampling"

Python 12 1 Updated May 21, 2024

Fully open reproduction of DeepSeek-R1

Python 24,437 2,250 Updated May 17, 2025

Lets make video diffusion practical!

Python 13,157 1,118 Updated May 4, 2025

该仓库主要记录 大模型(LLMs) 算法工程师相关的面试题

1,989 142 Updated Dec 26, 2024

[CVPR 2025 Oral] PyTorch re-implementation for Autoregressive Distillation of Diffusion Transformers (ARD).

Python 86 2 Updated Apr 16, 2025

[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型

Python 8,104 613 Updated Apr 27, 2025
Python 74 2 Updated Jun 7, 2024

[CVPR2025 Highlight] PAR: Parallelized Autoregressive Visual Generation. https://yuqingwang1029.github.io/PAR-project

Python 153 1 Updated Mar 20, 2025

This repository provides valuable reference for researchers in the field of multimodality, please start your exploratory travel in RL-based Reasoning MLLMs!

759 35 Updated May 13, 2025

code for "CoMT: A Novel Benchmark for Chain of Multi-modal Thought on Large Vision-Language Models"

Python 12 Updated Mar 10, 2025

A curated list of Awesome Personalized Large Multimodal Models resources

23 Updated May 13, 2025
Next
0