10000 WANGXinyiLinda (Xinyi Wang) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View WANGXinyiLinda's full-sized avatar
🏠
Working from home
🏠
Working from home

Highlights

  • Pro

Block or report WANGXinyiLinda

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

Function Vectors in Large Language Models (ICLR 2024)

Python 167 35 Updated Apr 17, 2025
Python 2,526 307 Updated May 19, 2024

Stanford NLP Python library for understanding and improving PyTorch models via interventions

Python 747 82 Updated Jun 2, 2025

Scalable RL solution for advanced reasoning of language models

Python 1,592 91 Updated Mar 18, 2025

A reading list for papers on causality for natural language processing (NLP)

645 69 Updated May 29, 2025

Interview questions for Computer Science faculty jobs

CSS 40 4 Updated Mar 13, 2024

[EMNLP 2023] MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions

Jupyter Notebook 111 12 Updated Sep 12, 2024

AIOS: AI Agent Operating System

Python 4,206 517 Updated May 21, 2025

This repo is meant to serve as a guide for Machine Learning/AI technical interviews.

Jupyter Notebook 6,345 1,140 Updated May 22, 2025

Machine Learning Interviews from FAANG, Snapchat, LinkedIn. I have offers from Snapchat, Coupang, Stitchfix etc. Blog: mlengineer.io.

10,951 1,771 Updated Aug 31, 2023
Python 4 Updated Aug 27, 2024

Implementation of paper Data Engineering for Scaling Language Models to 128K Context

Python 462 30 Updated Mar 19, 2024

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,179 68 Updated May 26, 2025

Fast, lightweight graphset operation library

C++ 475 40 Updated May 9, 2025

A one-stop library to standardize the inference and evaluation of all the conditional image generation models. (ICLR 2024)

Python 169 18 Updated Apr 16, 2025

Code for ACL2023 paper: Pre-Training to Learn in Context

Python 108 4 Updated Jul 26, 2024

[ACL'24 Oral] Analysing The Impact of Sequence Composition on Language Model Pre-Training

Python 21 5 Updated Aug 18, 2024

[ACL 2022] LinkBERT: A Knowledgeable Language Model 😎 Pretrained with Document Links

Python 437 42 Updated Apr 5, 2022

Source code for TACL paper "KEPLER: A Unified Model for Knowledge Embedding and Pre-trained Language Representation".

Python 201 23 Updated May 3, 2024

Wikidata client library for Python

Python 355 31 Updated Jul 10, 2024

A set of Python scripts for preprocessing the Wikidata JSON dump and running simple queries in an efficient manner.

Python 119 23 Updated Oct 17, 2024

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,210 1,056 Updated May 31, 2025

Retrieval and Retrieval-augmented LLMs

Python 9,825 720 Updated May 28, 2025

Code and Data for "Long-context LLMs Struggle with Long In-context Learning" [TMLR2025]

Python 106 8 Updated Feb 20, 2025

[ACL 2024 (Oral)] A Prospector of Long-Dependency Data for Large Language Models

Python 55 Updated Jul 23, 2024

Code for M4LE: A Multi-Ability Multi-Range Multi-Task Multi-Domain Long-Context Evaluation Benchmark for Large Language Models

Python 23 Updated Jul 27, 2024
Python 35 3 Updated Mar 25, 2024
Next
0