8000 GitHub - kyawhtetoo134/pdf-summarizer: PDF Summarizer is a Java Spring Boot web application that allows users to upload PDF files and receive concise text summaries. It uses Apache PDFBox to extract text and a custom frequency-based algorithm to identify and display the most relevant sentences.
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

PDF Summarizer is a Java Spring Boot web application that allows users to upload PDF files and receive concise text summaries. It uses Apache PDFBox to extract text and a custom frequency-based algorithm to identify and display the most relevant sentences.

Notifications You must be signed in to change notification settings

kyawhtetoo134/pdf-summarizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 

Repository files navigation

PDF Summarizer 📄✨

PDF Summarizer

PDF Summarizer is a Java Spring Boot web application that enables users to upload PDF files and receive concise text summaries. It employs Apache PDFBox to extract text and a custom frequency-based algorithm to identify and display the most relevant sentences.

Table of Contents

Features

  • User-Friendly Interface: Simple and intuitive design for easy navigation.
  • PDF Upload: Easily upload PDF files for summarization.
  • Text Extraction: Utilizes Apache PDFBox for accurate text extraction.
  • Summarization Algorithm: Implements a custom frequency-based algorithm for summarizing text.
  • Downl 9A45 oad Summaries: Download the summarized text for further use.
  • Web-Based: Accessible from any device with a web browser.

Technologies Used

  • Java: The core programming language for backend development.
  • Spring Boot: Framework used to create the web application.
  • Apache PDFBox: Library for PDF content extraction.
  • HTML/CSS/JavaScript: Technologies used for the frontend.
  • Thymeleaf: Template engine for rendering web pages.

Installation

To get started with PDF Summarizer, follow these steps:

  1. Clone the Repository:

    git clone https://github.com/kyawhtetoo134/pdf-summarizer.git
  2. Navigate to the Project Directory:

    cd pdf-summarizer
  3. Build the Project: Use Maven to build the project.

    mvn clean install
  4. Run the Application: Start the application using the following command:

    mvn spring-boot:run
  5. Access the Application: Open your web browser and go to http://localhost:8080.

Usage

  1. Upload a PDF: Click on the upload button to select a PDF file from your device.
  2. Generate Summary: After the file is uploaded, click on the summarize button.
  3. View Summary: The application will display the summarized text.
  4. Download Summary: Click the download button to save the summary as a text file.

How It Works

The PDF Summarizer uses a two-step process to generate summaries:

  1. Text Extraction: The application utilizes Apache PDFBox to extract text from the uploaded PDF file. This ensures that the text is accurately retrieved for summarization.

  2. Summarization: A custom frequency-based algorithm analyzes the extracted text. It identifies the most relevant sentences based on their frequency and context within the document. The algorithm ranks sentences and presents the top results as a summary.

Flow Diagram

Flow Diagram

Contributing

We welcome contributions from the community! If you would like to contribute, please follow these steps:

  1. Fork the Repository: Click the fork button on the top right of the repository page.
  2. Create a Branch: Create a new branch for your feature or bug fix.
    git checkout -b feature/YourFeatureName
  3. Make Changes: Implement your changes and commit them.
    git commit -m "Add your message here"
  4. Push Changes: Push your changes to your forked repository.
    git push origin feature/YourFeatureName
  5. Create a Pull Request: Go to the original repository and create a pull request.

License

This project is licensed under the MIT License. See the LICENSE file for details.

Contact

For any inquiries, please contact:

Releases

To download the latest version of the PDF Summarizer, visit the Releases section. You can find the latest builds and execute them as needed.

For more details, you can also check the Releases section.

Topics

This repository covers various topics related to automated text summarization, including:

  • Automated Text Summarization System
  • Document Summarizer Using Spring Boot
  • Extractive Summarization of PDF Documents
  • Java-Based PDF Summary Generator
  • Natural Language Processing for PDF Summarization
  • PDF Content Extraction and Summarization
  • PDF File Summarizer
  • Sentence Scoring Based Summarizer
  • Text Summarization from PDF Files
  • Web-Based PDF Summarizer Using Java

Thank you for checking out PDF Summarizer! We hope you find it useful for your document summarization needs.

About

PDF Summarizer is a Java Spring Boot web application that allows users to upload PDF files and receive concise text summaries. It uses Apache PDFBox to extract text and a custom frequency-based algorithm to identify and display the most relevant sentences.

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Contributors 2

  •  
  •  
0