8000 GitHub - ernestohegi/japanese-text-parser: ⛩ Yochimu, Japanese for 'prophetic dream,' aids in comfortable text comprehension by extracting words, providing definitions, translations, and example sentences from Japanese text files, and facilitating deck creation for learning via SRS tools like Anki.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

⛩ Yochimu, Japanese for 'prophetic dream,' aids in comfortable text comprehension by extracting words, providing definitions, translations, and example sentences from Japanese text files, and facilitating deck creation for learning via SRS tools like Anki.

Notifications You must be signed in to change notification settings

ernestohegi/japanese-text-parser

Repository files navigation

予知夢 Yochimu

Next.js React Node.js Japanese
Express Jest Vercel

🌸 Japanese Text Parser with Contextual Learning 🌸

Learn Japanese vocabulary efficiently through context with this minimalist tool


✨ Introduction

Yochimu (予知夢) means "prophetic dream" in Japanese. This tool helps you read Japanese texts comfortably by parsing the content, providing definitions, and showing example sentences—all with a minimalist Japanese aesthetic.

The goal is to facilitate an in-context learning approach for Japanese vocabulary, allowing you to:

  • 🔍 Search for Japanese words and see their meanings
  • 📝 View example sentences showing words in context
  • 💾 Save selected sentences for later study
  • 📤 Export vocabulary lists as TSV files for Anki flashcards

🌟 Live Demo

Visit https://yochimu.now.sh to try it now!

Or try instant translations by appending a search query:
https://yochimu.now.sh/?search=神様が大好き

📋 Features

  • Japanese text tokenization for accurate parsing
  • Anki integration through TSV export format
  • Context-based learning through example sentences
  • Responsive design for desktop and mobile

🔧 Technology Stack

Frontend

  • Next.js - React framework for server-rendered applications
  • React - UI component library
  • CSS-in-JS - Styled with inline styles and Japanese aesthetics

Backend

  • Node.js - JavaScript runtime
  • Express - Web application framework
  • Kuromoji - Japanese morphological analyzer for tokenization

Development & Testing

  • Jest - JavaScript testing framework
  • Cheerio - Server-side HTML parsing for scraping
  • Vercel - Deployment and hosting platform

🚀 Getting Started

Prerequisites

  • Node.js (v14 or later)
  • pnpm (or npm/yarn)

Installation

  1. Clone the repository:

    git clone https://github.com/ernestohegi/japanese-text-parser.git
    cd japanese-text-parser
  2. Install dependencies:

    pnpm install
  3. Start the development server:

    pnpm dev
  4. Open http://localhost:3000 in your browser

🧪 Using the Parser CLI

Parse Japanese text directly from the command line:

cd server
TEXT="私はペンです。" node index.js

📚 Testing

Run tests with:

pnpm test

🔮 Future Plans

  • Build an integrated SRS (Spaced Repetition System)
  • Add user accounts to save progress
  • Implement more dictionaries and translation sources
  • Add audio pronunciation for vocabulary

📝 Notes

This is an ongoing project (currently v0.1.0). We're still evaluating different dictionaries and translation services for optimal accuracy and performance.

⚙️ Dependencies

This parser relies on Kuromoji for Japanese text tokenization.


頑張ってください!

Made with ❤️ by Ernesto Hegi

About

⛩ Yochimu, Japanese for 'prophetic dream,' aids in comfortable text comprehension by extracting words, providing definitions, translations, and example sentences from Japanese text files, and facilitating deck creation for learning via SRS tools like Anki.

Topics

Resources

Stars

Watchers

Forks

Contributors 4

  •  
  •  
  •  
  •  
0