8000 GitHub - rudiejd/data-ingestion: Crontab for data ingestion/processing on AWS Lambda
[go: up one dir, main page]
More Web Proxy on the site http://driver.im/
Skip to content

rudiejd/data-ingestion

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

96 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

This is an app that can host all our data crunching jobs, especially those that use s3.

It uses Chalice, which makes it easy to add new functions that can be triggered on an interval or via rest API endpoint.

So far we have:

  • Store MBTA Alerts data daily.
  • Store number of trips with new trains on Orange and Red line daily.
  • Store Bluebikes station status data every 5 min.

To add a new lambda function, put the methods you need in a new file in chalicelib/. Then add your trigger in app.py. Then, add a policy.json file for any permissions you might need in .chalice/, updating .chalice/config.json accordingly. Lastly, if you need secret environment variables, update resources.json and make sure they are passed with --parameter-override in the cloudformation deploy step of deploy.sh. (Non-secret envvars can be added to config.json directly.)

About

Crontab for data ingestion/processing on AWS Lambda

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 98.9%
  • Shell 1.1%
0