SignalLake
Backend, data, and platform engineers need fast answers about service latency, failures, and slow endpoints without always requiring a warehouse or managed observability stack.
I designed and built an end-to-end log analytics platform that validates incoming events, preserves raw records in JSONL, transforms them into Hive-partitioned Parquet, and serves operational metrics through DuckDB and FastAPI. On a benchmark of 1 million events across 192 files, partition pruning reduced data scanned from 86.7 MB to 6.3 MB.
1M events · 192 files