Software Engineer, Data Infrastructure
You will build the data backbone for EICO.
Our models, evals, and incubation teams depend on clean traces of real company-building work: what agents tried, what customers did, what was delivered, what converted, and what was verified. You’ll turn that raw activity into trustworthy datasets for training, post-training, evaluation, and product.
What you’ll own:
- 01Batch and streaming pipelines for training, post-training, evaluation, and product data.
- 02Storage, processing, lineage, deduplication, quality checks, and dataset versioning for company-building traces.
- 03Data systems that connect agent activity to outcomes: demand, customers, delivery, revenue, and verification.
- 04Tools researchers and engineers use to query, inspect, debug, and trust the data.
What we’re looking for:
- Experience building large-scale data infrastructure in production.
- Strong Python and SQL. Spark, Ray, Kafka, Flink, Airflow, dbt, or similar experience preferred.
- Strong instincts for data quality, schema design, performance, and cost.
- Has built systems where correctness, lineage, and reliability mattered.
- Can turn messy events and traces into clean datasets without losing important context.
- Comfortable working across infra, research, product, and operations.
- Notices when something is off before anyone else does.
To apply, email careers@eico.so with a few lines about you and the most impressive thing you’ve built.