-
Purdue University
- West Lafayette, IN, USA
-
14:58
(UTC -04:00) - wenxin-jiang.github.io
Stars
Model Context Protocol Servers
This repository contains the Hugging Face Agents Course.
Windows graphical interface for yt-dlp, designed as a simple YouTube downloader
The OpenSSF CVE Benchmark consists of code and metadata for over 200 real life CVEs, as well as tooling to analyze the vulnerable codebases using a variety of static analysis security testing (SAST…
Tools and standards for conducting and evaluating research in software engineering
A final sanity checklist to help your CS paper get accepted, not desk rejected.
Official inference framework for 1-bit LLMs
A simple code complexity analyser without caring about the C/C++ header files or Java imports, supports most of the popular languages.
find relevant security papers published in the top-4 conferences (S&P, USENIX, CCS, NDSS)
List of Tech Company OAs. Save your time from finding them all over the internet.
机器学习工程师、算法工程师、软件工程师、数据科学家-面试指南 | Interview guide for MLE, SDE, DS
A vector search SQLite extension that runs anywhere!
A SQLite extension for efficient vector search, based on Faiss!
[WIP] Resources for AI engineers. Also contains supporting materials for the book AI Engineering (Chip Huyen, 2025)
🪼 a python library for doing approximate and phonetic matching of strings.
This repository contains LLM (Large language model) interview question asked in top companies like Google, Nvidia , Meta , Microsoft & fortune 500 companies.
Rapid fuzzy string matching in Python using various string metrics
Library for fast text representation and classification.
A minimal specification for purl aka. a package "mostly universal" URL, join the discussion at https://gitter.im/package-url/Lobby
An open-source dataset of malicious software packages found in the wild, 100% vetted by humans.
llama3 implementation one matrix multiplication at a time
Domain name permutation engine for detecting homograph phishing attacks, typo squatting, and brand impersonation
Datasets, tools, and benchmarks for representation learning of code.
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
GitHub Typo Corpus: A Large-Scale Multilingual Dataset of Misspellings and Grammatical Errors