8000 thak123 (Gaurish Thakkar) / Starred · GitHub

More Web Proxy on the site http://driver.im/

thak123

Follow

Gaurish Thakkar thak123

Follow

Researcher at University of Zagreb, FFZG. #NLP

40 followers · 173 following

University of Zagreb
Zagreb
02:31 (UTC +02:00)
https://orcid.org/0000-0002-8119-5078

Achievements

Achievements

Starred repositories

yangheng95 / ABSADatasets

Public & Community-shared datasets for Aspect-based sentiment analysis and Text Classification

HTML 229 66 Updated Jul 14, 2025

dzhng / deep-research

An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…

TypeScript 17,012 1,765 Updated Jun 7, 2025

hipstas / audio-labeler

An in-browser app for labeling audio clips at random, using Docker and Flask.

JavaScript 53 7 Updated Aug 28, 2017

deepset-ai / haystack

AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…

Python 21,520 2,264 Updated Jul 14, 2025

unit8co / darts

A python library for user-friendly forecasting and anomaly detection on time series.

Python 8,750 945 Updated Jul 4, 2025

meta-llama / llama-cookbook

Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…

Jupyter Notebook 17,616 2,552 Updated Jul 14, 2025

cisnlp / GlotCC

🕸 GlotCC Dataset and Pipline -- NeurIPS 2024

Jupyter Notebook 19 Updated Apr 6, 2025

huggingface / speech-to-speech

Speech To Speech: an effort for an open-sourced and modular GPT4-o

Python 4,103 464 Updated Apr 15, 2025

jzhang38 / TinyLlama

The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.

Python 8,641 547 Updated May 3, 2024

nlp-uoregon / mlmm-evaluation

Multilingual Large Language Models Evaluation Benchmark

Python 127 18 Updated Aug 21, 2024

tjunlp-lab / Awesome-LLMs-Evaluation-Papers

The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.

777 55 Updated May 8, 2024

py2many / py2many

Transpiler of Python to many other languages

Python 956 62 Updated Jun 17, 2025

Guitaricet / relora

Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates

Jupyter Notebook 458 39 Updated Apr 21, 2024

evilsocket / cake

Distributed LLM and StableDiffusion inference for mobile, desktop and server.

Rust 2,875 167 Updated Oct 23, 2024

QwenLM / Qwen3

Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.

Shell 22,548 1,522 Updated Jun 26, 2025

karpathy / LLM101n

LLM101n: Let's build a Storyteller

33,987 1,849 Updated Aug 1, 2024

swj0419 / detect-pretrain-code-contamination

Python 76 8 Updated Dec 26, 2023

EleutherAI / gpt-neox

An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries

Python 7,265 1,070 Updated Jul 1, 2025

soulbliss / NLP-conference-compendium

Compendium of the resources available from top NLP conferences.

461 50 Updated Feb 22, 2024

karpathy / llm.c

LLM training in simple, raw C/CUDA

Cuda 27,145 3,123 Updated Jun 26, 2025

uncbiag / Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks

1,049 54 Updated Jun 23, 2025

yuewang-cuhk / awesome-vision-language-pretraining-papers

Recent Advances in Vision and Language PreTrained Models (VL-PTMs)

1,152 104 Updated Aug 19, 2022

BradyFU / Awesome-Multimodal-Large-Language-Models

✨✨Latest Advances on Multimodal Large Language Models

15,807 1,028 Updated Jul 11, 2025

trimstray / the-book-of-secret-knowledge

A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.

177,866 11,094 Updated Nov 19, 2024

huggingface / candle

Minimalist ML framework for Rust

Rust 17,619 1,146 Updated Jul 7, 2025

pemistahl / lingua-py

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,423 46 Updated Jun 11, 2025

jianzhnie / awesome-instruction-datasets

A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。

685 36 Updated Apr 7, 2024

babylm / evaluation-pipeline-2023

Evaluation pipeline for the BabyLM Challenge 2023.

Python 76 20 Updated Oct 18, 2023

l294265421 / my-llm

All about large language models

51 7 Updated May 29, 2024

louismullie / treat

Natural language processing framework for Ruby.

Ruby 1,370 126 Updated May 16, 2025

Starred topics

Awesome Lists

interpretable-ai

explainable-ai

aspect-term-extraction

label-noise

noisy-labels

noisy-data

unreliable-labels

robust-learning

weak-supervision

See all starred topics

0