8000 vhientran (VHT) / Starred · GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View vhientran's full-sized avatar
💭
Keep Smiling :)
💭
Keep Smiling :)
  • Kyoto, Japan

Block or report vhientran

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Starred repositories

Showing results

Retrieval and Retrieval-augmented LLMs

Python 9,528 693 Updated Apr 15, 2025

Code for paper: [ICLR 2025] Surgical, Cheap, and Flexible: Mitigating False Refusal in Language Models via Single Vector Ablation

Python 2 2 Updated Apr 13, 2025
Ruby 6 Updated Jul 31, 2024

Official code repo for the O'Reilly Book - "Hands-On Large Language Models"

Jupyter Notebook 7,068 1,524 Updated Apr 25, 2025
Python 52 29 Updated Apr 26, 2022
Jupyter Notebook 1 Updated Feb 14, 2025

Log Probabilities Are a Reliable Estimate of Semantic Plausibility in Base and Instruction-Tuned Language Models

Jupyter Notebook 6 Updated Nov 13, 2024

LlamaIndex is the leading framework for building LLM-powered agents over your data.

Python 41,396 5,892 Updated May 3, 2025

Code and data for the Chain-of-Draft (CoD) paper

Python 263 32 Updated Mar 11, 2025

Code for 'LLM2Vec: Large Language Models Are Secretly Powerful Text Encoders'

Python 1,500 121 Updated Jan 24, 2025

Code and Slides

Jupyter Notebook 1,834 552 Updated Mar 16, 2025

A library with extensible implementations of DPO, KTO, PPO, ORPO, and other human-aware loss functions (HALOs).

Python 838 51 Updated May 1, 2025
Python 9 Updated Jun 14, 2024

This repository contains code for the paper Direct Preference Optimization with an Offset (ODPO).

Python 15 2 Updated Feb 17, 2025

Train transformer language models with reinforcement learning.

Python 13,577 1,856 Updated May 3, 2025

Official repository for ORPO

Python 450 43 Updated May 31, 2024

Code of "Improving Machine Translation with Human Feedback: An Exploration of Quality Estimation as a Reward Model"

Python 22 1 Updated Jun 28, 2024

[EMNLP 2024] mDPO: Conditional Preference Optimization for Multimodal Large Language Models.

Python 73 3 Updated Nov 10, 2024

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

Jupyter Notebook 1,737 267 Updated Dec 27, 2024

Code for "Preference Tuning For Toxicity Mitigation Generalizes Across Languages." Paper accepted at Findings of EMNLP 2024

Jupyter Notebook 17 Updated Mar 25, 2025
Python 17 6 Updated Aug 15, 2024

[NeurIPS 2024] Goldfish Loss: Mitigating Memorization in Generative LLMs

Python 84 6 Updated Nov 17, 2024

Localizing Memorized Sequences in Language Models

Jupyter Notebook 14 1 Updated Mar 24, 2025

Understanding the interplay between memorization and generalization in neural networks, featuring MAT, a learning algorithm to enhance robustness by mitigating spurious correlations.

Python 40 1 Updated Dec 19, 2024
Python 21 1 Updated Oct 29, 2024

A framework for few-shot evaluation of language models.

Python 8,834 2,355 Updated Apr 29, 2025
Python 29 4 Updated Oct 29, 2024

Evaluation of the Cross-Lingual Knowledge Alignment in LLMs

Python 9 Updated Apr 8, 2024
Next
0