Sr. Data Engineer, AGI-DS DE
Amazon
Description
Want to help build the next generation of intelligent assistant products? Join us! We are looking for a talented Sr Data Engineer to help us develop build the data pipelines and analytics platforms to achieve our vision.
What will you do? In this role, you will be responsible for implementing modern, responsible, and innovative data experiences across a breadth of customer facing solutions across multiple programs working with scientist customers as we collect, clean, and evaluate data used for AI model training.
Key job responsibilities
· Design, implement, and support a platform providing ad hoc access to large datasets
· Interface with other technology teams to extract, transform, and load data from a wide variety of data sources using SQL
· Implement data structures using best practices in data modeling, ETL/ELT processes, and SQL, Oracle, Redshift, and OLAP technologies
· Model data and metadata for ad hoc and pre-built reporting
· Interface with business customers, gathering requirements and delivering complete reporting solutions
· Build robust and scalable data integration (ETL) pipelines using SQL, Python and Spark.
· Build and deliver high quality datasets to support business analyst and customer reporting needs.
· Continually improve ongoing reporting and analysis processes, automating or simplifying self-service support for customers
· Participate in strategic & tactical planning discussions, including annual budget processes
A day in the life
A mix of helping the team design systems for building and managing training data, working with the team to deep dive into data quality issues, working with stakeholders to understand their requirements, and collaborating with sister teams.
About the team
AGI Data Services (AGI-DS) is part of the Artificial General Intelligence (AGI) organization that builds services for storing, accessing, and manually labeling data for training foundational and specialized AI models with world-class privacy and security for our end customers.
Basic Qualifications
– 5 years of data engineering experience
– Experience with data modeling, warehousing and building ETL pipelines
– Experience with SQL
– Experience in at least one modern scripting or programming language, such as Python, Java, Scala, or NodeJS
– Experience mentoring team members on best practices
Preferred Qualifications
– Experience with big data technologies such as: Hadoop, Hive, Spark, EMR
– Experience operating large data warehouses
Amazon is committed to a diverse and inclusive workplace. Amazon is an equal opportunity employer and does not discriminate on the basis of race, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other legally protected status. For individuals with disabilities who would like to request an accommodation, please visit https://www.amazon.jobs/en/disability/us.
Our compensation reflects the cost of labor across several US geographic markets. The base pay for this position ranges from $139,100/year in our lowest geographic market up to $240,500/year in our highest geographic market. Pay is based on a number of factors including market location and may vary depending on job-related knowledge, skills, and experience. Amazon is a total compensation company. Dependent on the position offered, equity, sign-on payments, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits. For more information, please visit https://www.aboutamazon.com/workplace/employee-benefits. This position will remain posted until filled. Applicants should apply via our internal or external career site.