8000 adrian27513 (Adrian Chan) Β· GitHub
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content
View adrian27513's full-sized avatar

Organizations

@Critical-Overload

Block or report adrian27513

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
adrian27513/README.md

Hi! πŸ‘‹ Welcome to My GitHub

About Me

I am an NCSU Master's student specializing in Artificial Intelligence (AI), with a strong interest in Reinforcement Learning (RL) and Computer Vision (CV).
I have completed multiple internships in Computer Vision and am currently working towards a Machine Learning Engineer role.

πŸ“Œ LinkedIn: adrian27513


πŸ”¬ My Research

Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition

Mehreen Saeed, Adrian Chan, Anupam Mijar, Joseph Moukarzel, Georges Habchi, Carlos Younes, Amin Elias, Chau-Wai Wong, Akram Khater

Published at: Advances in Neural Information Processing Systems (NeurIPS), Sep 26, 2024
πŸ“„ arXiv | πŸ’Ύ GitHub | πŸ“‚ Dataset

πŸ” Abstract

We present Muharaf, a dataset of 1,600+ historic handwritten Arabic page images transcribed by experts.
Each document includes polygonal coordinates for text lines and essential page elements.
This dataset enhances handwritten text recognition (HTR), particularly for cursive scripts.
It spans varied handwriting styles and document types, including letters, diaries, legal correspondences, and church records.
We describe the data acquisition pipeline, dataset features, and preliminary CNN-based baseline results.


HATFormer: Historic Handwritten Arabic Text Recognition with Transformers

Adrian Chan, Anupam Mijar, Mehreen Saeed, Chau-Wai Wong, Akram Khater

Under Review at: International Conference for Machine Learning (ICML)
πŸ“„ arXiv | πŸ’Ύ GitHub

πŸ” Abstract

Handwritten Arabic text recognition (HTR) is challenging due to varied writing styles and limited dataset availability.
We propose HATFormer, a transformer-based encoder-decoder model adapted from a state-of-the-art English HTR model.
HATFormer:

  • Leverages attention mechanisms to capture spatial contextual information.
  • Addresses cursive character differentiation, visual decomposition, and diacritic recognition.
  • Uses custom Arabic text tokenization and an optimized image processor for Arabic handwriting.

Performance Highlights:

  • 8.6% Character Error Rate (CER) on the largest historical Arabic HTR dataset (51% improvement over baselines).
  • 4.2% CER on the largest non-historical Arabic HTR dataset.
  • Demonstrates adaptability of English HTR methods to low-resource languages with script-specific challenges.

πŸ› οΈ Tech Stack

🌟 Main Skills

  • Primary: Python, PyTorch, Transformers, Computer Vision, Reinforcement Learning
  • Machine Learning: PyTorch, TensorFlow, OpenCV, Gymnasium, Retrieval-Augmented Generation, YOLO, Tesseract, NumPy, Pandas, Scikit-learn, GPT, LLMs, Transformers, Federated Learning, Vector Databases, Reinforcement Learning

πŸ’» Programming Languages

Python, Java, C, TypeScript/JavaScript, HTML5, SQL, R, MATLAB

πŸš€ Dev Tools & Frameworks

  • Development: Git, Docker, FastAPI, Microservices, Spring Boot, Angular
  • Databases & Cloud: AWS, PostgreSQL, MongoDB, Milvus
  • Testing & API Development: Postman

πŸ“« Get in Touch!

If you're interested in collaborations, research discussions, or just want to connect, feel free to reach out!

πŸ“Œ LinkedIn: adrian27513


Popular repositories Loading

  1. MOBA-AI-Gamer MOBA-AI-Gamer Public

    Python 5

  2. LeagueAI LeagueAI Public

    Forked from Oleffa/LeagueAI

    LeagueAI software framework for League of Legends that provides information about the state of the game based on Image Recognition using OpenCV and Pytorch.

    TeX

  3. LeagueWinrate LeagueWinrate Public

    TypeScript

  4. CSC230Projects CSC230Projects Public

    C

  5. ECE411Project ECE411Project Public

    Python

  6. StartFollowReadWindows StartFollowReadWindows Public

    Python

< 156C /ghcc-consent>
0