Data Engineer – Houston Astros (Houston, TX)

APPLY NOW!

The Houston Astros baseball organization is accepting applications for a Data Engineer to join our Research & Development team within Baseball Operations. We are seeking an applicant to support the growth of our data architecture using cloud-based data lake technologies. This role will work within a team of software developers supporting the broad need of Baseball Operations and will be central to the workflow of departments across the organization, including opportunities to interface with and understand the needs of other departments and drive creative solutions.

Essential Functions & Responsibilities

Reasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.

  • Collaborate with the team on the design and implementation of a cloud-based data architecture.
  • Leverage Spark-based solutions to develop and maintain data processing pipelines that provide efficient access to data at various stages of transformation.
  • Integrate structured, semi-structured, and unstructured data sources, handling various formats including Parquet, JSON, and more.
  • Automate workflows and monitoring procedures to promote a maintainable infrastructure.
  • Write clean and iterative code and leverage continuous integration practices to deploy, support and operate data pipelines.
  • Interact with stakeholders internal to R&D (research analysts, application developers, ) and external to understand their needs from our architecture and data.
  • Participate in a rotating on-call schedule to tend to any immediate issues with our architecture and data.
  • Perform other duties as assigned.