8000 shruti-singh (Shruti Singh) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View shruti-singh's full-sized avatar

Highlights

  • Pro

Block or report shruti-singh

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 17,374 1,716 Updated Jun 9, 2025

Fast lexical search implementing BM25 in Python using Numpy, Numba and Scipy

Python 1,200 68 Updated Jun 3, 2025

Large medical text dataset curated for abbreviation disambiguation, designed for natural language understanding pre-training in the medical domain

Python 271 44 Updated Oct 18, 2023

Benchmark for Brain Computer Interface methods

Python 16 7 Updated Feb 1, 2025

A Comprehensive Survey of Scientific Large Language Models and Their Applications in Scientific Discovery (EMNLP'24)

579 32 Updated Feb 26, 2025

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step

Jupyter Notebook 50,908 7,414 Updated Apr 20, 2025

Code/data for MARG (multi-agent review generation)

Python 44 5 Updated Nov 14, 2024

DSIR large-scale data selection framework for language model training

Python 250 19 Updated Apr 7, 2024

High accuracy RAG for answering questions from scientific documents with citations

Python 7,463 741 Updated May 28, 2025

Pre-training Multi-task Contrastive Learning Models for Scientific Literature Understanding (Findings of EMNLP'23)

Python 11 Updated Aug 24, 2024

Turn expensive prompts into cheap fine-tuned models

TypeScript 2,602 145 Updated May 25, 2024

[EMNLP'23, ACL'24] To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.

Python 5,150 295 Updated Mar 11, 2025

Minimal Python library to connect to LLMs (OpenAI, Anthropic, Google, Groq, Reka, Together, AI21, Cohere, Aleph Alpha, HuggingfaceHub), with a built-in model performance benchmark.

Python 776 52 Updated May 22, 2025

Minimal Python library to connect to LLMs (OpenAI, Anthropic, AI21, Cohere, Aleph Alpha, HuggingfaceHub, Google PaLM2, with a built-in model performance benchmark.

Python 1 Updated Oct 1, 2023

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 42,229 6,044 Updated Jun 10, 2025

A Python library for OpenAlex (openalex.org)

Python 244 32 Updated May 27, 2025

Public space for the user community of Semantic Scholar APIs to share scripts, report issues, and make suggestions.

229 34 Updated Jan 24, 2025

Examples and guides for using the OpenAI API

MDX 64,559 10,619 Updated Jun 10, 2025

Data and tools for generating and inspecting OLMo pre-training data.

Python 1,236 143 Updated Jun 5, 2025

Pretraining Efficiently on S2ORC!

Python 164 5 Updated Oct 23, 2024

Aligned, Review-Informed Edits of Scientific Papers

Python 52 1 Updated Jul 5, 2023

🦜🔗 Build context-aware reasoning applications

Jupyter Notebook 109,181 17,760 Updated Jun 10, 2025

🦙 Integrating LLMs into structured NLP pipelines

Python 1,259 99 Updated Jan 8, 2025

Open source codebase powering the HuggingChat app

TypeScript 8,826 1,332 Updated Jun 10, 2025

⚡ Automating scientific workflows with AI ⚡

Python 386 39 Updated Aug 15, 2024

Awesome-LLM: a curated list of Large Language Model

23,720 1,990 Updated May 9, 2025

OpenAlpaca: A Fully Open-Source Instruction-Following Model Based On OpenLLaMA

Python 302 34 Updated Jun 13, 2023
Python 91 8 Updated May 14, 2024
Next
0