8000 swj0419 (Weijia Shi) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View swj0419's full-sized avatar

Block or report swj0419

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

539 26 Updated Jul 4, 2025

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,572 330 Updated Jul 3, 2025

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 75 16 Updated Aug 17, 2024

Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation

Python 23 1 Updated Feb 18, 2025

Official Repo for Open-Reasoner-Zero

Python 1,982 107 Updated Jun 2, 2025

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 797 75 Updated Mar 14, 2025

Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance

Jupyter Notebook 75 2 Updated Jun 23, 2025

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,165 52 Updated Jun 18, 2025

Generate textbook-quality synthetic LLM pretraining data

Python 501 48 Updated Oct 19, 2023

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)

Python 136 17 Updated May 29, 2025

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 720 556 Updated Jul 4, 2024
Python 2 Updated Jan 24, 2024

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,260 212 Updated Mar 5, 2024

This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu , Terra Blevins…

Python 226 27 Updated Nov 3, 2023

EcoAssistant: using LLM assistant more affordably and accurately

Python 132 6 Updated Jun 30, 2024

SILO Language Models code repository

Python 81 11 Updated Feb 23, 2024

Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.

Python 205 11 Updated Aug 28, 2024

Failure archive for ChatGPT and similar models

Python 594 23 Updated Apr 7, 2023

Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.

22 1 Updated Nov 23, 2022

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

499 31 Updated Jun 25, 2024

Retrieval augmented diffusion from CompVis.

Jupyter Notebook 53 7 Updated Aug 20, 2022

A modular RL library to fine-tune language models to human preferences

Python 2,322 199 Updated Mar 1, 2024

Code and models for the paper "Questions Are All You Need to Train a Dense Passage Retriever (TACL 2023)"

Python 62 4 Updated Dec 27, 2022

Tevatron - Unified Document Retrieval Toolkit across Scale, Language, and Modality. Demo in SIGIR 2023, SIGIR 2025.

Python 661 109 Updated Jun 12, 2025

Pyserini is a Python toolkit for reproducible information retrieval research with sparse and dense representations.

Python 1,882 420 Updated Jul 1, 2025

PyTorch + HuggingFace code for RetoMaton: "Neuro-Symbolic Language Modeling with Automaton-augmented Retrieval" (ICML 2022), including an implementation of kNN-LM and kNN-MT

Python 275 25 Updated Oct 20, 2022

Parallel data preprocessing for NLP and ML.

Python 34 2 Updated Nov 1, 2024

[EMNLP 2022] Training Language Models with Memory Augmentation https://arxiv.org/abs/2205.12674

Python 196 13 Updated Jun 14, 2023

Must-read papers on prompt-based tuning for pre-trained language models.

4,232 387 Updated Jul 17, 2023
Next
0