8000 zwcolin (Zirui Wang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View zwcolin's full-sized avatar
🎃
Hello there
🎃
Hello there

Highlights

  • Pro

Organizations

@ucsd-ets @princeton-nlp @dsc-courses

Block or report zwcolin

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

procedural reasoning datasets

Python 589 55 Updated May 19, 2025
HTML 1 Updated May 18, 2025

A customizable gym environment for maze/gridworld

Jupyter Notebook 6 2 Updated Apr 27, 2018
Shell 2 Updated May 15, 2025

Awesome Reasoning in MLLMs: Papers and Projects about learning to reason with MLLMs, including Chain-of-Thought (CoT), OpenAl o1, and DeepSeek-R1

51 3 Updated Mar 18, 2025

Verifiers for LLM Reinforcement Learning

Python 986 117 Updated May 21, 2025

Witness the aha moment of VLM with less than $3.

Python 3,679 284 Updated May 19, 2025

Random maze environments with different size and complexity for reinforcement learning research.

Python 2 Updated Apr 30, 2024

A customizable framework to create maze and gridworld environments

Python 266 61 Updated Apr 5, 2019

A framework for few-shot evaluation of language models.

Python 8,983 2,403 Updated May 22, 2025

A fork to add multimodal model training to open-r1

Python 1,267 61 Updated Feb 8, 2025

LLaVA-MORE: A Comparative Study of LLMs and Visual Backbones for Enhanced Visual Instruction Tuning

Python 135 8 Updated Apr 24, 2025
Python 53 3 Updated Nov 5, 2024

A collection of materials for CS application

2 Updated Dec 21, 2024
Python 1 Updated Dec 12, 2024
Python 113 15 Updated Jul 14, 2022
HTML 19 3 Updated Nov 26, 2024

qpdf: A content-preserving PDF document transformer

C++ 4,000 311 Updated May 21, 2025

A novel Multimodal Large Language Model (MLLM) architecture, designed to structurally align visual and textual embeddings.

Python 917 57 Updated Mar 25, 2025
Python 16 1 Updated Dec 11, 2024

Open-source evaluation toolkit of large multi-modality models (LMMs), support 220+ LMMs, 80+ benchmarks

Python 2,393 365 Updated May 22, 2025

A curated list of recent and past chart understanding work based on our IEEE TKDE survey paper: From Pixels to Insights: A Survey on Automatic Chart Understanding in the Era of Large Foundation Mod…

202 19 Updated Apr 18, 2025

🔽 Display any CSV (comma separated values) file as a searchable, filterable, pretty HTML table

CSS 1,006 335 Updated Mar 8, 2024

Refine high-quality datasets and visual AI models

Python 9,500 635 Updated May 22, 2025

A project page template for academic papers. Demo at https://eliahuhorwitz.github.io/Academic-project-page-template/

JavaScript 3,101 565 Updated Jan 24, 2025

[NeurIPS 2024] CharXiv: Charting Gaps in Realistic Chart Understanding in Multimodal LLMs

Python 113 12 Updated Apr 22, 2025

Label Studio is a multi-type data labeling and annotation tool with standardized output format

JavaScript 22,164 2,749 Updated May 22, 2025

The AI Datastore for Schemas, BLOBs, and Predictions. Use with your apps or integrate built-in Human Supervision, Data Workflow, and UI Catalog to get the most value out of your AI Data.

Python

1,863 121 Updated Nov 18, 2024

Recent LLM-based CV and related works. Welcome to comment/contribute!

864 38 Updated Mar 8, 2025
Next
0