Internal Amazon project for comprehensive analysis of git repositories—tracking dependencies, code changes, and git metrics across thousands of npm monorepos, websites, and GenAI/AgenticAI projects.
An internal TypeScript application at Amazon for analyzing repositories across Amazon's internal git server. The system continuously tracks ~2,000 repositories, ~900 CI/CD pipelines, and ~600 internal websites—surfacing dependency graphs, code churn, contributor activity, and git metrics at scale.
The platform crawls Amazon's internal git infrastructure to identify and classify repositories by type (npm monorepos, frontends, Jupyter notebooks, GenAI/AgenticAI projects) and runs analysis jobs against each one. Teams use it to answer questions like: which repos depend on this internal package?, how active is this codebase?, and who are the top contributors across this org?
Job processing is handled by BullMQ workers backed by Redis, with results stored in DynamoDB. The frontend is a React SPA using tRPC for type-safe data fetching, Material UI with @emotion for styling, and React Query for cache management. Infrastructure is defined with AWS CDK and the monorepo is managed with NX.
CloudWatch RUM provides real-user performance metrics for tracking adoption and catching regressions.
Locked behind Amazon's internal VPN — no public demo available.
Working on this sparked a personal project. The experience of building repository analysis tooling inspired SaaS Recipes—a separate, independently built collection of curated SaaS templates and starter kits with detailed tech stack breakdowns.