CMS Claims Comparison Pipeline

Documentation hub for the data comparison pipeline. Browse reports, interactive tools, design documentation, and reference materials.

Reports & Analysis
📊 HTML
Comparison Report
Full analysis report with interactive charts, discrepancy dashboard, validation results, and financial reconciliation.
Interactive Tools
🏗 HTML
Architecture Diagrams
Interactive Mermaid.js diagrams showing the 6-step pipeline, data flow, cloud deployment, and technology stack.
🗃 HTML
Schema Explorer
Drag-and-drop ERD viewer for all 8 DuckDB tables. Click tables for column details, zoom, pan, and search.
📂 HTML
Parquet Viewer
In-browser Parquet file viewer powered by hyparquet. Drag and drop any .parquet file to explore schema and data.
🐿 HTML
SQL Explorer
Run SQL queries on Parquet files in your browser. Powered by Squirreling async SQL engine + hyparquet. No server needed.
Reviewer Guide
📜 HTML
Reviewer Walkthrough
Guided tour: what to look at, in what order, and why. Includes screenshots, code reading order, and assessment requirements mapping.
💬 HTML
Assessment Feedback
Approach rationale, skills demonstrated, design trade-offs, and reflections on the assessment.
HTML
Requirements Traceability
Maps each assessment requirement to its implementation with evidence and file locations.
Design Documentation
🛠 HTML
Solution Design
Architecture decisions, design rationale for DuckDB/pipeline/Docker, analysis findings, and deployment strategy.
HTML
Pipeline Reference
Complete pipeline reference: all 6 steps, data model, 129 validation and comparison checks, output artifacts.
📚 HTML
Data Dictionary
Dataset overview, column definitions for all tables, old/new system schemas, and codebook reference.
Reference Materials
📋 PDF
DE-SynPUF Codebook
Official CMS codebook with field definitions, value sets, and data dictionary for all DE-SynPUF files.
PDF
Frequently Asked Questions
CMS FAQ covering data limitations, known issues, and guidance for working with synthetic claims data.
📑 PDF
Data Users Guide
Comprehensive guide to the DE-SynPUF synthetic data: methodology, file layouts, and usage notes.