Platform Summary
Cloud-native anime data platform implementing a medallion architecture on AWS to ingest, transform, and serve analytics-ready datasets for APIs, dashboards, and ML workflows.
Coursework Detail Page
Anime Harbor
A medallion-architecture data platform that ingests, transforms, and serves anime analytics datasets on AWS.
Cloud-native anime data platform implementing a medallion architecture on AWS to ingest, transform, and serve analytics-ready datasets for APIs, dashboards, and ML workflows.
Bronze
Raw Data Layer
Unprocessed API and dataset payloads in JSON/CSV on S3, partitioned by source and ingest date.
Silver
Cleaned Data Layer
Glue ETL standardizes schema, removes duplicates, validates records, and writes partitioned Parquet.
Gold
Serving Layer
Analytics-ready aggregates such as top-rated anime, trending titles, and genre-level insights.
| Area | Tools |
|---|---|
| Storage | Amazon S3 (bronze/silver/gold prefixes) |
| ETL + Catalog | AWS Glue jobs, crawlers, Data Catalog, PySpark |
| Orchestration | EventBridge schedules, optional Lambda triggers |
| Serving | Athena queries + Node.js/Express REST API |
| Reliability | Schema checks, null/duplicate validation, CloudWatch logging |
/ingestion, /etl, /models, /api.year/month/day for scale.