NCAAGameHighlights

HighlightProcessor

This project uses RapidAPI to obtain NCAA game highlights using a Docker container and uses AWS Media Convert to convert the media file.

File Overview

The config.py script performs the following actions: Imports necessary environment variables and assigns them to Python variables, providing default values where appropriate. This approach allows for flexible configuration management, enabling different settings for various environments (e.g., development, staging, production) without modifying the source code.

The fetch.py script performs the following actions:

Establishes the date and league that will be used to find highlights. We are using NCAA in this example because it's included in the free version. This will fetch the highlights from the API and store them in an S3 bucket as a JSON file (basketball_highlight.json)

process_one_video.py performs the following actions:

Connects to the S3 bucket and retrieves the JSON file. Extracts the first video URL from within the JSON file. Downloads the video fiel from the internet into the memory using the requests library. Saves the video as a new file in the S3 bucket under a different folder (videos/) Logs the status of each step

mediaconvert_process.py performs the following actions:

Creates and submits a MediaConvert job Uses MediaConvert to process a video file - configures the video codec, resolution and bitrate. Also configured the audio settings Stores the processed video back into an S3 bucket

run_all.py performs the following actions: Runs the scripts in a chronological order and provides buffer time for the tasks to be created.

.env file stores all over the environment variables, these are variables that we don't want to hardcode into our script.

Dockerfile performs the following actions: Provides the step by step approach to build the image.

Terraform Scripts: These scripts are used to created resources in AWS in a scalable and repeatable way. All of the resources we work with like S3, creating IAM user roles, elastic registry service and elastic container services is built here.

Prerequisites

Before running the scripts, ensure you have the following:

1 Create Rapidapi Account

Rapidapi.com account, will be needed to access highlight images and videos.

For this example we will be using NCAA (USA College Basketball) highlights since it's included for free in the basic plan.

Sports Highlights API is the endpoint we will be using

2 Verify prerequites are installed

Docker should be pre-installed in most regions docker --version

AWS CloudShell has AWS CLI pre-installed aws --version

Python3 should be pre-installed also python3 --version

3 Retrieve AWS Account ID

Copy your AWS Account ID Once logged in to the AWS Management Console Click on your account name in the top right corner You will see your account ID Copy and save this somewhere safe because you will need to update codes in the labs later

4 Retrieve Access Keys and Secret Access Keys

You can check to see if you have an access key in the IAM dashboard Under Users, click on a user and then "Security Credentials" Scroll down until you see the Access Key section You will not be able to retrieve your secret access key so if you don't have that somewhere, you need to create an access key.

Technical Diagram

Project Structure

src/
├── Dockerfile
├── config.py
├── fetch.py
├── mediaconvert_process.py
├── process_one_video.py
├── requirements.txt
├── run_all.py
├── .env
├── .gitignore
└── terraform/
    ├── main.tf
    ├── variables.tf
    ├── secrets.tf
    ├── iam.tf
    ├── ecr.tf
    ├── ecs.tf
    ├── s3.tf
    ├── container_definitions.tpl
    └── outputs.tf

START HERE - Local

Step 1: Clone The Repo

git clone https://github.com/alahl1/NCAAGameHighlights.git
cd src

Step 2: Add API Key to AWS Secrets Manager

aws secretsmanager create-secret \
    --name my-api-key \
    --description "API key for accessing the Sport Highlights API" \
    --secret-string '{"api_key":"YOUR_ACTUAL_API_KEY"}' \
    --region us-east-1

Step 3: Create an IAM role or user

In the search bar type "IAM"

Click Roles -> Create Role

For the Use Case enter "S3" and click next

Under Add Permission search for AmazonS3FullAccess, MediaConvertFullAccess and AmazonEC2ContainerRegistryFullAccess and click next

Under Role Details, enter "HighlightProcessorRole" as the name

Select Create Role

Find the role in the list and click on it Under Trust relationships Edit the trust policy to this: Edit the Trust Policy and replace it with this:

{
  "Version": "2012-10-17",
  "Statement": [
    {
      "Effect": "Allow",
      "Principal": {
        "Service": [
          "ec2.amazonaws.com",
          "ecs-tasks.amazonaws.com",
          "mediaconvert.amazonaws.com"
        ],
        "AWS": "arn:aws:iam::<"your-account-id">:user/<"your-iam-user">"
      },
      "Action": "sts:AssumeRole"
    }
  ]
}

Step 4: Update .env file

RapidAPI_KEY: Ensure that you have successfully created the account and select "Subscribe To Test" in the top left of the Sports Highlights API
AWS_ACCESS_KEY_ID=your_aws_access_key_id_here
AWS_SECRET_ACCESS_KEY=your_aws_secret_access_key_here
S3_BUCKET_NAME=your_S3_bucket_name_here
MEDIACONVERT_ENDPOINT=https://your_mediaconvert_endpoint_here.amazonaws.com

aws mediaconvert describe-endpoints

MEDIACONVERT_ROLE_ARN=arn:aws:iam::your_account_id:role/HighlightProcessorRole

Step 5: Secure .env file

chmod 600 .env

Step 6: Locally Buikd & Run The Docker Container

Run:

docker build -t highlight-processor .

Run the Docker Container Locally:

docker run --env-file .env highlight-processor

This will run fetch.py, process_one_video.py and mediaconvert_process.py and the following files should be saved in your S3 bucket:

Optional - Confirm there is a video uploaded to s3:///videos/first_video.mp4

Optional - Confirm there is a video uploaded to s3:///processed_videos/

What We Learned

Working with Docker and AWS Services
Identity Access Management (IAM) and least privilege
How to enhance media quality

Future Enhancements

Using Terraform to enhance the Infrastructure as Code (IaC)
Increasing the amount of videos process and converted with AWS Media Convert
Change the date from static (specific point in time) to dyanmic (now, last 30 days from today's date,etc)

Part 2 - Terraform Bonus

Setup terraform.tfvars File

In the github repo, there is a resources folder and copy the entire contents
In the AWS Cloudshell or vs code terminal, create the file vpc_setup.sh and paste the script inside.
Run the script

bash vpc_setup.sh

You will see variables in the output, paste these variables into lines 8-13.
Store your API key in AWS Secrets Manager

aws ssm put-parameter \
  --name "/myproject/rapidapi_key" \
  --value "YOUR_SECRET_KEY" \
  --type SecureString

Run the following script to obtain your mediaconvert_endpoint:

aws mediaconvert describe-endpoints --query "Endpoints[0].Url" --output text

Leave the mediaconvert_role_arn string empty

Helpful Tip for Beginners:

Use the same region, project, S3 Bucketname and ECR Repo name to make following along easier. Certain steps like pushing the docker image to the ECR repo is easier to copy and paste without remember what you named your repo :)

Run The Project

Navigate to the terraform folder/workspace in VS Code From the src folder

cd terraform

Initialize terraform working directory

terraform init

Check syntax and validity of your Terraform configuration files

terraform validate

Display execution plan for the terraform configuration

terraform plan

Apply changes to the desired state

terraform apply -var-file="terraform.dev.tfvars"

Create an ECR Repo

aws ecr create-repository --repository-name highlight-pipeline

Log into ECR

aws ecr get-login-password --region us-east-1 | \
  docker login --username AWS --password-stdin <AWS_ACCOUNT_ID>.dkr.ecr.us-east-1.amazonaws.com

Build and Push the Docker Image

docker build -t highlight-pipeline:latest .
docker tag highlight-pipeline:latest <AWS_ACCOUNT_ID>.dkr.ecr.<REGION>.amazonaws.com/highlight-pipeline:latest

docker push <AWS_ACCOUNT_ID>.dkr.ecr.<REGION>.amazonaws.com/highlight-pipeline:latest

Destroy ECS and ECR resources

In the AWS Cloudshell or vs code terminal, create the file ncaaprojectcleanup.sh and paste the script inside from the resources folder.
Run the script

bash ncaaprojectcleanup.sh

Review Video Files

Navigate to the S3 Bucket and confirm there is a json video in the highlights folder and a video in the videos folder

What We Learned

Deploying local docker images to ECR
A high level overview of terraform files
Networking - VPCs, Internet Gateways, private subnets and public subnets
SSM for saving secrets and pulling into terraform

Future Enhancements

Automating the creation of VPCs/networking infra, media endpoint

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
resources		resources
src		src
README.md		README.md

Uh oh!

Repository files navigation

NCAAGameHighlights

HighlightProcessor

File Overview

Prerequisites

1 Create Rapidapi Account

2 Verify prerequites are installed

3 Retrieve AWS Account ID

4 Retrieve Access Keys and Secret Access Keys

Technical Diagram

Project Structure

START HERE - Local

Step 1: Clone The Repo

Step 2: Add API Key to AWS Secrets Manager

Step 3: Create an IAM role or user

Step 4: Update .env file

Step 5: Secure .env file

Step 6: Locally Buikd & Run The Docker Container

What We Learned

Future Enhancements

Part 2 - Terraform Bonus

Setup terraform.tfvars File

Run The Project

Destroy ECS and ECR resources

Review Video Files

What We Learned

Future Enhancements

About

Uh oh!

Releases

Packages

Languages

alahl1/NCAAGameHighlights

Folders and files

Latest commit

History

Repository files navigation

NCAAGameHighlights

HighlightProcessor

File Overview

Prerequisites

1 Create Rapidapi Account

2 Verify prerequites are installed

3 Retrieve AWS Account ID

4 Retrieve Access Keys and Secret Access Keys

Technical Diagram

Project Structure

START HERE - Local

Step 1: Clone The Repo

Step 2: Add API Key to AWS Secrets Manager

Step 3: Create an IAM role or user

Step 4: Update .env file

Step 5: Secure .env file

Step 6: Locally Buikd & Run The Docker Container

What We Learned

Future Enhancements

Part 2 - Terraform Bonus

Setup terraform.tfvars File

Run The Project

Destroy ECS and ECR resources

Review Video Files

What We Learned

Future Enhancements

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages