8000 GitHub - yfqiu-nlp/mfact-summ: Materials for paper "Detecting and Mitigating Hallucinations in Cross-Lingual Transfer for Abstractive Summarisation"
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

Materials for paper "Detecting and Mitigating Hallucinations in Cross-Lingual Transfer for Abstractive Summarisation"

Notifications You must be signed in to change notification settings

yfqiu-nlp/mfact-summ

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

29 Commits
 
 
 
 
 
 

Repository files navigation

Detecting and Mitigating Hallucinations in Multilingual Summarisation

Updates | mFACT | Loss-weighting | Cite | Paper

Code and materials for the paper "Detecting and Mitigating Hallucinations in Multilingual Summarisation".

Please see the detailed instructions for using our mFACT metrics in mfact, and our loss weighing model in loss-weighting.

Updates

  • mFACT metrics for six languages are now avaliable at HuggingFace! (20/05/2023)

Released Mateirals

Here is a quick navigation to all our released materials.

Translated Faithfulness Classification Datasets

We upload our curated multilingual faithfulness classification dataset in google drive.

mFACT Metrics

Languages mFACT
Chinese HF link
Spanish HF link
French HF link
Hindi HF link
Vietnamese HF link
Turkish HF link
English (mFACT-Transfer) HF link

Note: mFACT-Transfer - this is the version we trained on our English faithfulness classification dataset and performed zero-shot transfer for other languages.

Loss-Weighting Summarizer

Please check both our trained language and summarisation adapter in huggingface.

Languages R-1 mFACT mFACT-Transfer Outputs
Chinese 31.25 42.97 36.02 Google drive
Spanish 23.49 23.37 33.11 Google drive
French 27.46 37.11 40.88 Google drive
Hindi 24.97 34.26 26.46 Google drive
Vietnamese 28.04 39.47 38.20 Google drive
Turkish 17.38 37.80 29.20 Google drive

MAD-X Summarizer

Please check both our trained language and summarisation adapter in huggingface.

Languages R-1 mFACT mFACT-Transfer Outputs
Chinese 28.97 34.58 30.62 Google drive
Spanish 22.83 21.24 29.28 Google drive
French 25.80 40.24 43.64 Google drive
Hindi 24.79 25.74 16.89 Google drive
Vietnamese 26.97 34.59 35.21 Google drive
Turkish 17.05 32.15 22.63 Google drive

Citation

@misc{qiu2023detecting,
      title={Detecting and Mitigating Hallucinations in Multilingual Summarisation}, 
      author={Yifu Qiu and Yftah Ziser and Anna Korhonen and Edoardo M. Ponti and Shay B. Cohen},
      year={2023},
      eprint={2305.13632},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}

About

Materials for paper "Detecting and Mitigating Hallucinations in Cross-Lingual Transfer for Abstractive Summarisation"

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published
0