Data Pipeline Engineer (ML) Job at OSI Engineering, Seattle, WA

MWZGUEZNeGtLeTJsRHZvcitJVnlXNFp2aXc9PQ==
  • OSI Engineering
  • Seattle, WA

Job Description

Our client is scaling production ML systems and needs a hands-on engineer to help build, maintain, and run essential ML data pipelines . You’ll own high-throughput data ingestion and transformation workflows (including image- and array-type modalities), enforce rigorous data quality standards, and partner with research and platform teams to keep models fed with reliable, versioned datasets.

  • Design, build, and operate reliable ML data pipelines for batch and/or streaming use cases across cloud environments.
  • Develop robust ETL/ELT processes (ingest, validate, cleanse, transform, and publish) with clear SLAs and monitoring.
  • Implement data quality gates (schema checks, null/outlier handling, drift and bias signals) and data versioning for reproducibility.
  • Optimize pipelines for distributed computing and large modalities (e.g., images, multi-dimensional arrays).
  • Automate repetitive workflows with CI/CD and infrastructure-as-code; document, test, and harden for production.
  • Collaborate with ML, Data Science, and Platform teams to align datasets, features, and model training needs.

Minimum Qualifications:

5+ years building and operating data pipelines in production.

  • Cloud: Hands-on with AWS , Azure , or GCP services for storage, compute, orchestration, and security.
  • Programming: Strong proficiency in Python and common data/ML libraries ( pandas , NumPy , etc.).
  • Distributed compute: Experience with at least one of Spark , Dask , or Ray .
  • Modalities: Experience handling image-type and array-type data at scale.
  • Automation: Proven ability to automate repetitive tasks (shell/Python scripting, CI/CD).
  • Data Quality: Implemented validation, cleansing, and transformation frameworks in production.
  • Data Versioning: Familiar with tools/practices such as DVC , LakeFS , or similar.
  • Languages: Fluent in English or Farsi .
  • Strongly PreferredSQL expertise (writing performant queries; optimizing on large datasets).
  • Data warehousing/lakehouse concepts and tools (e.g., Snowflake/BigQuery/Redshift; Delta/Lakehouse patterns).
  • Data virtualization/federation exposure (e.g., Presto/Trino) and semantic/metadata layers.
  • Orchestration (Airflow, Dagster, Prefect) and observability/monitoring for data pipelines.
  • MLOps practices (feature stores, experiment tracking, lineage, artifacts).
  • Containers & IaC (Docker; Terraform/CloudFormation) and CI/CD for data/ML workflows.
  • Testing for data/ETL (unit/integration tests, great_expectations or similar).
  • Soft Skills Executes independently and creatively ; comfortable owning outcomes in ambiguous environments.
  • Proactive communicator who collaborates cross-functionally with DS/ML/Platform stakeholders.

Location: Seattle, WA

Duration: 1+ year

Pay: $56/hr

Job Tags

Similar Jobs

Ultimate Staffing

Family Enrollment Specialist - Call Center CSR Job at Ultimate Staffing

 ...for: The Best Staffing Firm to Work for, The Best Staffing Firm to...  ...in a high-volume work from home environment is key to success....  ...policies Provide service through chat, email, social media, and...  ...significant risk to its business operations and business reputation unless... 

Cyberbacker Inc

Real Estate Virtual Assistant - Permanent Work From Home Job at Cyberbacker Inc

 ...goal of partnering great individuals with clients who share the same values and characters. We believe that like-minded individuals working towards the same goals or business have the highest capacity for growth.****Responsibilities*** Transcription: Listen to live or... 

U S Lawns of Roanoke

Gardener Job at U S Lawns of Roanoke

Job Summary:U S Lawns of Roanoke is seeking a dedicated and motivated Gardener to join our team in Lynchburg, Virginia. This is a full-time, hourly position in the franchise industry. As a Gardener, you will be responsible for maintaining and beautifying our clients'... 

AIR Communities

Apartment Maintenance Technician Job at AIR Communities

 ...We Are AIR Communities owns and operates best-in-class apartment communities in major markets across the country. Our communities...  ...AIRs process for completion of basic apartment turns to make the apartment ready for the next residents perfect move-in experience With... 

Tri-State Forest Products

Class A Flatbed Driver - Lawrenceburg, KY Job at Tri-State Forest Products

 ...Class A Flatbed Driver Lawrenceburg, KY Build your career at Tri-State! Tri-State Forest Products is a family owned and operated business. Tri-State is the leading wholesale building materials distributor in the Mid-West with 10 locations in Ohio, Indiana, Michigan...