8000 markus583 (Markus F) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View markus583's full-sized avatar

Highlights

  • Pro

Block or report markus583

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A toolkit implementing advanced methods to transfer models and model knowledge across tokenizers.

Python 25 4 Updated May 14, 2025

arXiv LaTeX Cleaner: Easily clean the LaTeX code of your paper to submit to arXiv

Python 6,193 359 Updated Jun 2, 2025

🎛 🔊 A Python library for audio.

C++ 5,564 293 Updated May 20, 2025

Large Concept Models: Language modeling in a sentence representation space

Python 2,223 201 Updated Jan 29, 2025

Train and Infer Powerful Sentence Embeddings with AnglE | 🔥 SOTA on STS and MTEB Leaderboard

Python 545 38 Updated Mar 16, 2025

LaTeX Thesis and Technical Report Template for Johannes Kepler University Linz

TeX 30 10 Updated Jan 16, 2025

A repository of links with advice related to grad school applications, research, phd etc

2,218 214 Updated Nov 12, 2023

Readability-aware automatic lyrics transcription (ALT) evaluation toolkit

Python 41 1 Updated Aug 29, 2024

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,533 123 Updated Jan 24, 2025

The lastest paper about detection of LLM-generated text and code

270 15 Updated May 17, 2025

Benchmarking library for RAG

Jupyter Notebook 208 21 Updated Jun 11, 2025

This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and cont…

Jupyter Notebook 17,496 1,726 Updated Jun 9, 2025

4M: Massively Multimodal Masked Modeling

Python 1,732 107 Updated Jun 2, 2025

Code for Zero-Shot Tokenizer Transfer

Python 129 11 Updated Jan 14, 2025

SemEval2024-task8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection

Python 76 30 Updated Apr 22, 2024

The most accurate natural language detection library for Python, suitable for short text and mixed-language text

Python 1,389 45 Updated Jun 11, 2025

A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.

1,759 231 Updated Oct 16, 2024

Everything you want to know about Google Cloud TPU

Python 529 30 Updated Jul 16, 2024

TensorDict is a pytorch dedicated tensor container.

Python 928 93 Updated Jun 11, 2025

Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.

Python 2,406 187 Updated Jun 4, 2025

Interactively inspect module inputs, outputs, parameters, and gradients.

Python 340 23 Updated May 15, 2025

Toolkit to segment text into sentences or other semantic units in a robust, efficient and adaptable way.

Python 1,064 61 Updated Apr 1, 2025

Friends don't let friends make certain types of data visualization - What are they and why are they bad.

R 6,748 268 Updated Dec 10, 2024

A repo for open resources & information for people to succeed in PhD in CS & career in AI / NLP

922 78 Updated Sep 22, 2024

2024 up-to-date list of DATASETS, CODEBASES and PAPERS on Multi-Task Learning (MTL), from Machine Learning perspective.

757 58 Updated Jun 5, 2025

torchview: visualize pytorch models

Python 946 45 Updated May 18, 2025

An optimized deep prompt tuning strategy comparable to fine-tuning across scales and tasks

Python 2,041 204 Updated Nov 16, 2023

Collection of Tools and Papers related to Adapters / Parameter-Efficient Transfer Learning/ Fine-Tuning

Python 193 11 Updated May 4, 2024

Find data quality issues and clean your data in a single line of code with a Scikit-Learn compatible Transformer.

Python 130 13 Updated Dec 13, 2023

A plug-and-play library for parameter-efficient-tuning (Delta Tuning)

Python 1,028 83 Updated Sep 19, 2024
Next
0