The Data Engineer, Baseball Systems position will be a member of the baseball operations software development team, and is responsible for integrating, collecting, processing, and storing many sources of baseball data, as well as designing and building new data solutions. This position must be comfortable with on-premises and cloud solutions, and take the initiative to explore new optimizations and cutting-edge data technologies. This individual will work closely with our data architect, analysts, developers, and other members of baseball operations.
RESPONSIBILITIES:
Build leading-edge baseball solutions together with the software development team, analysts, and others on new and existing baseball systems
Build and maintain integration pipelines, often via an API or file-based, while also identifying areas of improvement and spending time to re-architect when required
Build and maintain infrastructure to optimize extraction, transformation, and the loading of data from various sources
Design, build, and maintain data warehousing solutions for the software development and analytics teams
Build and maintain tools for the analysts to enable more efficient and extensive data modeling and simulation efforts
Participate in key phases of the software development process of critical baseball applications, including requirements gathering, analysis, effort estimation, technical investigation, software design and implementation, testing, bug fixing, and quality assurance
Actively participate with software developers and data architects in design reviews, code reviews, and other best practices
Work closely at times with baseball analysts to design and implement data solutions
Respond to and resolve technical problems and issues in a timely manner
SKILLS & QUALIFICATIONS:
TECHNICAL SKILLS
Bachelor’s degree in Computer Science, Software Engineering, Computer Engineering, Statistics, Information Systems, or a related field
2-3 years of experience in a Date Engineer role
Proficiency with SQL and query optimizations, stored procedures, views, and other database objects
Experience building custom API integrations, interfacing with JSON, XML, and custom data structures
Experience with AWS, GCP, or Azure cloud services, such as Cloud SQL, RDS, Redshift, Azure SQL, Azure SQL DW, or others
Experience building data solutions using Python, C#, C++, Ruby, or other languages
Experience with scheduling and workflow management platforms, such as Airflow
Experience with ETL tools and pipelines from various platforms
Experience with big data frameworks such as Hadoop or Spark is a plus
Experience with R and RStudio is a plus
GENERAL SKILLS
Ability to work autonomously and as a team in a fast paced environment
High level of attention to detail with the ability to multi-task effectively
Comfortable working remotely using Zoom, Teams, Slack, Trello, and other tools to communicate with all team members
High degree of professionalism and ability to maintain confidential information
Excellent organizational and time management skills
An understanding of baseball is a plus
The Red Sox (or FSM) requires proof of being up-to-date on COVID-19 vaccination as a condition of employment, subject to applicable legal requirements. Up-to-date means having received all recommended COVID-19 vaccination doses in the primary series and a booster dose(s) when eligible, per CDC guidelines.
Prospective employees will receive consideration without discrimination based on race, religious creed, color, sex, age, national origin, handicap, disability, military/veteran status, ancestry, sexual orientation, gender identity/expression or protected genetic information.