Q&A Chatbot for Google Store Reviews

This project is a Q&A tool designed to extract actionable insights from a large dataset of Google Store reviews for a music streaming application, such as Spotify. The tool leverages natural language processing and vectorized databases to provide insightful responses to management queries.

Introduction

This project addresses the challenge of extracting insights from 3.4 million unstructured Google Store reviews. The management of a music streaming application requires insights into what users like, dislike, compare, and suggest about the application. This tool aims to provide an efficient way to extract this information using AI and vectorized storage.

Objectives

The primary objectives of this project are:

Data Preprocessing and Vectorized Database Creation: Preprocess the Google Store reviews dataset and create a vectorized database for efficient information retrieval.
RAG Chain Creation: Develop a Retrieval-Augmented Generation (RAG) chain to retrieve relevant information based on management's queries.
Build a Chatbot UI: Design and implement a user-friendly chatbot interface using Streamlit to allow easy interaction with the Q&A tool.

Dataset Overview

The dataset contains Google Store reviews of a music streaming application. It includes:

Review ID: Unique identifier for each review.
Pseudo Author ID: Anonymized identifier for the author.
Author Name: Name of the reviewer (anonymized).
Review Text: Content of the review.
Review Rating: Numeric rating provided by the user.
Review Likes: Number of likes the review received.
App Version: Version of the application reviewed.
Review Timestamp: Date and time of the review.

The dataset can be accessed through one of these below:

Dataset: Download here
Dataset source: Kaggle - Spotify Google Store Reviews

Features

Question Answering: Answers questions based on user reviews of the music streaming app.
Insights on Competitors: Provides comparisons with other music streaming platforms.
User-Friendly Interface: Streamlit-based UI with chat history and sample queries.
Interactive Typing Animation: Simulates typing for a conversational experience.

Setup and Installation

Frameworks and Libraries

This project utilizes the following frameworks and libraries:

Python 3.12.7
[CUDA](https://developer.nvidia.com/cuda-toolkit) (optional, for GPU support)
OpenAI 1.7.2
- Embedding model: text-embedding-ada-002
- Chat model: gpt-4o-mini
Chroma 0.4.22 (Vector Database)
LangChain 0.1.0
Streamlit 1.40.0

Installation Steps

Clone the repository:

git clone https://github.com/yourusername/qna-chatbot.git
cd qna-chatbot

Install dependencies:
```
pip install -r requirements.txt
```
Set up environment variables by creating a .env file in the root directory and add your OpenAPI API key:
```
OPENAI_API_KEY=your_openai_api_key
```

(Optional) Verify CUDA support:

import torch
print("CUDA available:", torch.cuda.is_available())

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.devcontainer		.devcontainer
assets		assets
README.md		README.md
app.py		app.py
build_rag_chatbot.ipynb		build_rag_chatbot.ipynb
create_vector_store.ipynb		create_vector_store.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Q&A Chatbot for Google Store Reviews

Table of Contents

Introduction

Objectives

Dataset Overview

Features

Setup and Installation

Frameworks and Libraries

Installation Steps

Objective 1: Data Preprocessing and Vectorized Database Creation

Steps

Objective 2: RAG Chain Creation

Steps

Objective 3: Build a Chatbot UI

How it Works

Usage

Screenshots

Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Languages

falthackel/qna-chatbot

Folders and files

Latest commit

History

Repository files navigation

Q&A Chatbot for Google Store Reviews

Table of Contents

Introduction

Objectives

Dataset Overview

Features

Setup and Installation

Frameworks and Libraries

Installation Steps

Objective 1: Data Preprocessing and Vectorized Database Creation

Steps

Objective 2: RAG Chain Creation

Steps

Objective 3: Build a Chatbot UI

How it Works

Usage

Screenshots

Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages