-
University of Zagreb
- Zagreb
-
02:31
(UTC +02:00) - https://orcid.org/0000-0002-8119-5078
Starred repositories
Public & Community-shared datasets for Aspect-based sentiment analysis and Text Classification
An AI-powered research assistant that performs iterative, deep research on any topic by combining search engines, web scraping, and large language models. The goal of this repo is to provide the si…
An in-browser app for labeling audio clips at random, using Docker and Flask.
AI orchestration framework to build customizable, production-ready LLM applications. Connect components (models, vector DBs, file converters) to pipelines or agents that can interact with your data…
A python library for user-friendly forecasting and anomaly detection on time series.
Welcome to the Llama Cookbook! This is your go to guide for Building with Llama: Getting started with Inference, Fine-Tuning, RAG. We also show you how to solve end to end problems using Llama mode…
🕸 GlotCC Dataset and Pipline -- NeurIPS 2024
Speech To Speech: an effort for an open-sourced and modular GPT4-o
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
Multilingual Large Language Models Evaluation Benchmark
The papers are organized according to our survey: Evaluating Large Language Models: A Comprehensive Survey.
Official code for ReLoRA from the paper Stack More Layers Differently: High-Rank Training Through Low-Rank Updates
Distributed LLM and StableDiffusion inference for mobile, desktop and server.
Qwen3 is the large language model series developed by Qwen team, Alibaba Cloud.
An implementation of model parallel autoregressive transformers on GPUs, based on the Megatron and DeepSpeed libraries
Compendium of the resources available from top NLP conferences.
A curated list of foundation models for vision and language tasks
Recent Advances in Vision and Language PreTrained Models (VL-PTMs)
✨✨Latest Advances on Multimodal Large Language Models
A collection of inspiring lists, manuals, cheatsheets, blogs, hacks, one-liners, cli/web tools and more.
The most accurate natural language detection library for Python, suitable for short text and mixed-language text
A collection of awesome-prompt-datasets, awesome-instruction-dataset, to train ChatLLM such as chatgpt 收录各种各样的指令数据集, 用于训练 ChatLLM 模型。
Evaluation pipeline for the BabyLM Challenge 2023.
Natural language processing framework for Ruby.