8000 loretoparisi (Loreto Parisi) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View loretoparisi's full-sized avatar
🐍
NightShift
🐍
NightShift

Organizations

@Musixmatchdev @musixmatchresearch

Block or report loretoparisi

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

This is InfiniRetri, a tool enhance Transformer-based LLMs(Large Language Model) ablity to hangle Long-Context.

95 9 Updated Mar 27, 2025
47 2 Updated Feb 17, 2025

Here's all my Python/Numba (CUDA) code for the encoder block I made :)

Python 59 9 Updated Apr 28, 2025

accompanying material for sleep-time compute paper

Python 70 5 Updated Apr 21, 2025
Python 287 16 Updated Apr 18, 2025

21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/

Jupyter Notebook 81,495 41,884 Updated Apr 28, 2025

FULL v0, Cursor, Manus, Same.dev, Lovable, Devin, Replit Agent, Windsurf Agent & VSCode Agent (And other Open Sourced) System Prompts, Tools & AI Models.

37,118 11,290 Updated Apr 27, 2025

User tools for RAPIDS GitHub Actions

Shell 2 18 Updated Apr 28, 2025

🚀 The fast, Pythonic way to build MCP servers and clients

Python 8,091 422 Updated Apr 29, 2025

prima.cpp: Speeding up 70B-scale LLM inference on low-resource everyday home clusters

C++ 757 42 Updated Apr 28, 2025

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,498 121 Updated Jan 24, 2025

Single File, Single GPU, From Scratch, Efficient, Full Parameter Tuning library for "RL for LLMs"

Jupyter Notebook 431 36 Updated Apr 8, 2025

This package contains the original 2012 AlexNet code.

Cuda 2,573 332 Updated Mar 12, 2025

Kyutai with an "eye"

Python 189 25 Updated Mar 26, 2025

An open source implementation of CLIP (With TULIP Support)

Python 132 2 Updated Mar 21, 2025

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

Python 2,963 211 Updated Apr 23, 2025

L1: Controlling How Long A Reasoning Model Thinks With Reinforcement Learning

Python 194 21 Updated Mar 18, 2025

Learn how to use CUA (our Computer Using Agent) via the API on multiple computer environments.

Python 828 191 Updated Apr 24, 2025

Research repository on interfacing LLMs with Weaviate APIs. Inspired by the Berkeley Gorilla LLM.

Jupyter Notebook 123 9 Updated Apr 22, 2025

Deep Research for your internal data

Python 312 33 Updated Apr 28, 2025

Di♪♪Rhythm: Blazingly Fast and Embarrassingly Simple End-to-End Full-Length Song Generation with Latent Diffusion

Python 1,508 149 Updated Apr 20, 2025

An AI Hedge Fund Team

Python 26,727 4,596 Updated Apr 29, 2025

Collection of pretrained models for the Montreal Forced Aligner

Python 144 22 Updated Jul 11, 2024

DeepEP: an efficient expert-parallel communication library

Cuda 7,518 726 Updated Apr 29, 2025

DeepGEMM: clean and efficient FP8 GEMM kernels with fine-grained scaling

Python 5,277 575 Updated Apr 28, 2025

FlashMLA: Efficient MLA decoding kernels

Cuda 11,505 826 Updated Apr 29, 2025

Cost-efficient and pluggable Infrastructure components for GenAI inference

Jupyter Notebook 3,506 333 Updated Apr 30, 2025
Next
0