swj0419

Weijia Shi swj0419

https://weijia-shi.netlify.app/

Achievements

Stars

zhaochen0110 / Awesome_Think_With_Images

Resources and paper list for "Thinking with Images for LVLMs". This repository accompanies our survey on how LVLMs can leverage visual information for complex reasoning, planning, and generation.

539 26 Updated Jul 4, 2025

agentica-project / rllm

Democratizing Reinforcement Learning for LLMs

Jupyter Notebook 3,572 330 Updated Jul 3, 2025

hamishivi / EasyLM

Forked from young-geng/EasyLM

Large language models (LLMs) made easy, EasyLM is a one stop solution for pre-training, finetuning, evaluating and serving LLMs in JAX/Flax.

Python 75 16 Updated Aug 17, 2024

alvin-zyl / CoLA

Implementation of CoLA: Compute-Efficient Pre-Training of LLMs via Low-Rank Activation

Python 23 1 Updated Feb 18, 2025

Open-Reasoner-Zero / Open-Reasoner-Zero

Official Repo for Open-Reasoner-Zero

Python 1,982 107 Updated Jun 2, 2025

allenai / OLMoE

OLMoE: Open Mixture-of-Experts Language Models

Jupyter Notebook 797 75 Updated Mar 14, 2025

1jsingh / negtome

Official Implementation for paper: Negative Token Merging: Image-based Adversarial Feature Guidance

Jupyter Notebook 75 2 Updated Jun 23, 2025

lucidrains / transfusion-pytorch

Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI

Python 1,165 52 Updated Jun 18, 2025

VikParuchuri / textbook_quality

Generate textbook-quality synthetic LLM pretraining data

Python 501 48 Updated Oct 19, 2023

zhichaoxu-shufe / context-aware-decoding-qfs

Python 12 Updated Jan 10, 2024

InfiAgent / InfiAgent

InfiAgent-DABench: Evaluating Agents on Data Analysis Tasks (ICML 2024)

Python 136 17 Updated May 29, 2025

TRI-ML / prismatic-vlms

A flexible and efficient codebase for training visually-conditioned language models (VLMs)

Python 720 556 Updated Jul 4, 2024

minimario / math-retrieval

Python 2 Updated Jan 24, 2024

EvolvingLMMs-Lab / Otter

🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.

Python 3,260 212 Updated Mar 5, 2024

swj0419 / detect-pretrain-code

This repository provides an original implementation of Detecting Pretraining Data from Large Language Models by *Weijia Shi, *Anirudh Ajith, Mengzhou Xia, Yangsibo Huang, Daogao Liu , Terra Blevins…

Python 226 27 Updated Nov 3, 2023

JieyuZ2 / EcoAssistant

EcoAssistant: using LLM assistant more affordably and accurately

Python 132 6 Updated Jun 30, 2024

kernelmachine / silo-lm

SILO Language Models code repository

Python 81 11 Updated Feb 23, 2024

huggingface / OBELICS

Code used for the creation of OBELICS, an open, massive and curated collection of interleaved image-text web documents, containing 141M documents, 115B text tokens and 353M images.

Python 205 11 Updated Aug 28, 2024

giuven95 / chatgpt-failures

Failure archive for ChatGPT and similar models

Python 594 23 Updated Apr 7, 2023

wyu97 / RACo

Resources for Retrieval Augmentation for Commonsense Reasoning: A Unified Approach. EMNLP 2022.

22 1 Updated Nov 23, 2022

suzgunmirac / BIG-Bench-Hard

Challenging BIG-Bench Tasks and Whether Chain-of-Thought Can Solve Them

499 31 Updated Jun 25, 2024

afiaka87 / retrieval-augmented-diffusion

Forked from CompVis/latent-diffusion

Retrieval augmented diffusion from CompVis.

Jupyter Notebook 53 7 Updated Aug 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Weijia Shi swj0419

Achievements

Achievements

Block or report swj0419

Stars

zhaochen0110 / Awesome_Think_With_Images

agentica-project / rllm

hamishivi / EasyLM

alvin-zyl / CoLA

Open-Reasoner-Zero / Open-Reasoner-Zero

allenai / OLMoE

1jsingh / negtome

lucidrains / transfusion-pytorch

VikParuchuri / textbook_quality

zhichaoxu-shufe / context-aware-decoding-qfs

InfiAgent / InfiAgent

TRI-ML / prismatic-vlms

minimario / math-retrieval

EvolvingLMMs-Lab / Otter

swj0419 / detect-pretrain-code

JieyuZ2 / EcoAssistant

kernelmachine / silo-lm

huggingface / OBELICS

giuven95 / chatgpt-failures

wyu97 / RACo

suzgunmirac / BIG-Bench-Hard

afiaka87 / retrieval-augmented-diffusion

allenai / RL4LMs

DevSinghSachan / art

texttron / tevatron

castorini / pyserini

neulab / knn-transformers

r2llab / wrangl

princeton-nlp / TRIME

thunlp / PromptPapers