hunterheiden

hunterheiden

Achievements

Stars

jsvine / pdfplumber

Plumb a PDF for detailed information about each char, rectangle, line, et cetera — and easily extract text and tables.

Python 7,983 758 Updated Jun 22, 2025

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

Awesome-LLM-Robustness: a curated list of Uncertainty, Reliability and Robustness in Large Language Models

768 50 Updated May 21, 2025

SpursGoZmy / Awesome-Tabular-LLMs

We collect papers about "large language models (LLM) for table-related tasks", e.g., using LLM for Table QA task. “表格+LLM”相关论文整理

511 37 Updated Jul 9, 2025

unslothai / unsloth

Fine-tuning & Reinforcement Learning for LLMs. 🦥 Train Qwen3, Llama 4, DeepSeek-R1, Gemma 3, TTS 2x faster with 70% less VRAM.

Python 41,887 3,341 Updated Jul 11, 2025

huggingface / chug

Minimal sharded dataset loaders, decoders, and utils for multi-modal document, image, and text datasets.

Python 159 11 Updated Apr 3, 2024

webdataset / webdataset

A high-performance Python-based I/O system for large (and small) deep learning problems, with strong support for PyTorch.

Python 2,696 216 Updated Jun 19, 2025

aldolipani / TABME

Jupyter Notebook 5 1 Updated Jan 6, 2023

fabraz / pss-isjeaai

Jupyter Notebook 6 2 Updated Feb 22, 2022

microsoft / UICaption

We release the UICaption dataset. The dataset consists of UI images (icons and screenshots) and associated text descriptions. This dataset was used to pre-train the Lexi model which provides a gene…

Python 41 6 Updated Nov 29, 2022

vkarampinis / awesome-icons

A curated list of awesome Web Font Icons

1,419 77 Updated Mar 27, 2025

Liuhong99 / Sophia

The official implementation of “Sophia: A Scalable Stochastic Second-order Optimizer for Language Model Pre-training”

Python 963 57 Updated Jan 30, 2024

google-research-datasets / screen_annotation

The Screen Annotation dataset consists of pairs of mobile screenshots and their annotations. The annotations are in text format, and describe the UI elements present on the screen: their type, loca…

72 11 Updated Mar 7, 2024

EvolvingLMMs-Lab / lmms-eval

A One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks

Python 2,736 332 Updated Jul 11, 2025

github / gitignore

A collection of useful .gitignore templates

167,949 83,051 Updated Jul 7, 2025

google-research-datasets / screen_qa

ScreenQA dataset was introduced in the "ScreenQA: Large-Scale Question-Answer Pairs over Mobile App Screenshots" paper. It contains ~86K question-answer pairs collected by human annotators for ~35K…

Python 120 9 Updated Feb 7, 2025

OSU-NLP-Group / Mind2Web

[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web" -- the first LLM-based web agent and benchmark for generalist web agents

Jupyter Notebook 842 112 Updated Apr 3, 2025

aburns4 / MoTIF

Mobile App Tasks with Iterative Feedback (MoTIF): Addressing Task Feasibility in Interactive Visual Environments

Jupyter Notebook 61 3 Updated Aug 19, 2024

OpenBMB / MiniCPM

MiniCPM4: Ultra-Efficient LLMs on End Devices, achieving 5+ speedup on typical end-side chips

Jupyter Notebook 8,079 503 Updated Jul 8, 2025

mlfoundations / open_flamingo

An open-source framework for training large multimodal models.

Python 3,976 306 Updated Aug 31, 2024

njucckevin / SeeClick

The model, data and code for the visual GUI Agent SeeClick

HTML 398 19 Updated Nov 22, 2024

stas00 / ml-engineering

Machine Learning Engineering Open Book

Python 14,282 862 Updated Jul 9, 2025

deepspeedai / DeepSpeed-MII

MII makes low-latency and high-throughput inference possible, powered by DeepSpeed.

Python 2,029 186 Updated Jun 30, 2025

deepspeedai / DeepSpeed

DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

Python 39,306 4,460 Updated Jul 11, 2025

js0nwu / webui

Jupyter Notebook 117 17 Updated Dec 4, 2023

google-research-datasets / widget-caption

The dataset includes widget captions that describes UI element's functionalities. It is used for training and evaluation of the widget captioning model (please see the EMNLP'20 paper: https://arxiv…

22 2 Updated Jun 24, 2021

google-research-datasets / uibert

It includes two datasets that are used in the downstream tasks for evaluating UIBert: App Similar Element Retrieval data and Visual Item Selection (VIS) data. Both datasets are written TFRecords.

44 4 Updated Aug 2, 2021

HazyResearch / m2

Repo for "Monarch Mixer: A Simple Sub-Quadratic GEMM-Based Architecture"

Assembly 555 42 Updated Dec 28, 2024

johnma2006 / mamba-minimal

Simple, minimal implementation of the Mamba SSM in one file of PyTorch.

Python 2,830 208 Updated Mar 8, 2024

shabie / docformer

Implementation of DocFormer: End-to-End Transformer for Document Understanding, a multi-modal transformer based architecture for the task of Visual Document Understanding (VDU)

Python 280 41 Updated Feb 13, 2023

furkanbiten / idl_data

OCR Annotations from Amazon Textract for Industry Documents Library

Python 102 7 Updated Aug 20, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

hunterheiden

Achievements

Achievements

Block or report hunterheiden

Stars

jsvine / pdfplumber

jxzhangjhu / Awesome-LLM-Uncertainty-Reliability-Robustness

SpursGoZmy / Awesome-Tabular-LLMs

unslothai / unsloth

huggingface / chug

webdataset / webdataset

aldolipani / TABME

fabraz / pss-isjeaai

microsoft / UICaption

vkarampinis / awesome-icons

Liuhong99 / Sophia

google-research-datasets / screen_annotation

EvolvingLMMs-Lab / lmms-eval

github / gitignore

google-research-datasets / screen_qa

OSU-NLP-Group / Mind2Web

aburns4 / MoTIF

OpenBMB / MiniCPM

mlfoundations / open_flamingo

njucckevin / SeeClick

stas00 / ml-engineering

deepspeedai / DeepSpeed-MII

deepspeedai / DeepSpeed

js0nwu / webui

google-research-datasets / widget-caption

google-research-datasets / uibert

HazyResearch / m2

johnma2006 / mamba-minimal

shabie / docformer

furkanbiten / idl_data