10000 jonathanmutal (Jonathan Mutal) Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View jonathanmutal's full-sized avatar

Block or report jonathanmutal

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
jonathanmutal/README.md

πŸ‘‹ Hi, I'm Jonathan Mutal

I'm a researcher, developer, and PhD candidate at the Faculty of Translation and Interpreting (FTI), University of Geneva πŸ‡¨πŸ‡­. I specialize in multilingual machine translation for medical communication, with a focus on low-resource languages, human evaluation, and the use of large language models.


🧠 Research Focus

  • πŸ₯ Machine Translation for Medical Interactions
  • 🌍 Low-Resource and Multilingual NLP
  • πŸ€– Large Language Models (LLMs), In-Context Learning
  • 🧾 Human Evaluation & Semantic Metrics
  • πŸ” Instruction Tuning, Domain Adaptation, RAG

My dissertation is titled:
"Evaluating Large Language Models for Low-Resource Multilingual Machine Translation in Medical Interactions"
As part of the PictoDr and BabelDr projects, I evaluate and build translation systems for over 140 language combinations β€” including translation into pictographs β€” to support communication between healthcare providers and patients.


πŸ› οΈ Tools & Technologies

🧠 Machine Translation & NLP

  • Frameworks: Hugging Face Transformers, MarianNMT, NLLB, Mistral, OpenNMT, Pytorch, Moses
  • Techniques: Multilingual MT, domain adaptation, in-context learning, instruction tuning
  • Evaluation: BLEU, ChrF, COMET, concept-level F1 (UMLS), human evaluation (adequacy, fluency, usability)
  • Retrieval & Representation: BM25, LASER, UMLS-based concept mapping, gloss alignment, giza++

πŸ“ˆ Statistical & Evaluation Methods

  • Cumulative Link Mixed Models (CLMM)
  • Fleiss’ Kappa (inter-annotator agreement)
  • Regression, ANOVA, correlation analysis
  • Likert scale design and analysis

🧰 Programming & Libraries

  • Languages: Python, PHP, JavaScript, Bash
  • Core Libraries: PyTorch, NumPy, Pandas, Scikit-learn, Matplotlib, Seaborn
  • NLP & ML: SentencePiece, Tokenizers, Accelerate, datasets
  • Security: bcrypt, Argon2, PHPMailer

πŸ§‘β€πŸ’» Software Development & Infrastructure

  • GDPR compliance: consent, encryption, data retention
  • Web & backend dev (PHP, MySQL, custom forms)
  • Git, GitHub Actions, Docker (basic usage)
  • Windows sever R12 and Ubuntu server

πŸ“š Academic Tools

  • LaTeX (custom Thesis.cls, TikZ, BibTeX)
  • Markdown, Overleaf
  • Reviewing: CLIN32, Languages and Resources in Springer Nature 2024, COLING 2024, AT4SSL, MT Summit 2025

πŸš€ Recent Highlights

  • Built a multilingual MT training pipeline with multitask validation and biomedical vocabulary adaptation
  • Conducted large-scale human evaluations of medical translation (Arabic, Spanish, Farsi, Albanian, Tigrinya…)
  • Released models and data via Hugging Face: Models)

πŸ“« Get in Touch


Thanks for visiting! πŸ‘‹

Popular repositories Loading

  1. docentes docentes Public

    Building a new module for odoo's framework. Odoo is an all-in-one management software that offers a range of business applications that form a complete suite of enterprise management applications t…

    Python 2 3

  2. dataMining dataMining Public

    Cool 1

  3. statistics_course statistics_course Public

    The course will cover the following topics: Discrete probability, Conditional probability and Bayes’ Rule, Random variables, expectation, variance, and correlation, Common distribution families, Co…

    Jupyter Notebook 1 1

  4. meteorRos meteorRos Public

    JavaScript

  5. FreeCodeCamp FreeCodeCamp Public

    Forked from freeCodeCamp/freeCodeCamp

    The http://FreeCodeCamp.com open source codebase and curriculum. Learn to code and help nonprofits.

    JavaScript

  6. webtorrent-desktop webtorrent-desktop Public

    Forked from webtorrent/webtorrent-desktop

    πŸš€ WebTorrent, the streaming torrent client. For OS X, Windows, and Linux.

    JavaScript

0