Engineering · Company Building Agents San Francisco or New York · SF preferred

Software Engineer, Data Infrastructure

You will build the data backbone for EICO.

Our models, evals, and incubation teams depend on clean traces of real company-building work: what agents tried, what customers did, what was delivered, what converted, and what was verified. You’ll turn that raw activity into trustworthy datasets for training, post-training, evaluation, and product.

What you’ll own:

01Batch and streaming pipelines for training, post-training, evaluation, and product data.
02Storage, processing, lineage, deduplication, quality checks, and dataset versioning for company-building traces.
03Data systems that connect agent activity to outcomes: demand, customers, delivery, revenue, and verification.
04Tools researchers and engineers use to query, inspect, debug, and trust the data.

What we’re looking for:

Experience building large-scale data infrastructure in production.
Strong Python and SQL. Spark, Ray, Kafka, Flink, Airflow, dbt, or similar experience preferred.
Strong instincts for data quality, schema design, performance, and cost.
Has built systems where correctness, lineage, and reliability mattered.
Can turn messy events and traces into clean datasets without losing important context.
Comfortable working across infra, research, product, and operations.
Notices when something is off before anyone else does.

To apply, email careers@eico.so with a few lines about you and the most impressive thing you’ve built.

← All roles