Data Engineering Internship

  • Red Hat, Inc.
  • Boston, MA, USA
  • Oct 07, 2019
Internship Aerospace engineering Astronomy Biology Computer Science Chemical engineering Engineering Health Science Life Science Mathematics Medical Sciences Chemistry Civil engineering Physics Psychology Social Science Teaching/Academics Technology Veterinary medicine


At Red Hat, we connect an innovative community of customers, partners, and contributors to deliver an open source stack of trusted, high-performing solutions. We offer cloud, Linux, middleware, storage, and virtualization technologies, together with award-winning global customer support, consulting, and implementation services. Red Hat is a rapidly growing company supporting more than 90% of Fortune 500 companies.

Job summary

The Red Hat Artificial Intelligence (AI) Center of Excellence (CoE) team is looking for a Data Engineering Intern to join us in Boston, MA. In this role, you will contribute to our understanding and modeling of data related to product logs, product stacks, and product quality data. As a subscription-based company, Red Hat uses data to identify factors that promote quality and optimize packaging and distribution of our offerings. Using Elasticsearch, Logstash, Apache NiFi, and Goblin, you will process large amounts of data in batch and real-time for use in these analyses.

Primary job responsibilities

  • Analyze structured and unstructured data sources to integrate them into a common data store
  • Create visualizations as a means of communicating insights into data
  • Contribute to data distribution techniques to optimize storage for data analysis
  • Evaluate and advise on data platform tools and technologies for inclusion in project work
  • Prototype ideas and communicate results
  • Collaborate with other developers and analysts across teams regarding data ingestion and availability

Required skills

  • Ability to work full-time hours during summer 2020 in the location listed
  • Familiarity with Elasticsearch, Amazon S3, and Logstash
  • Data manipulation experience using Python, Apache NiFi, or other language
  • Ability to communicate key insights and findings to business stakeholders
  • Knowledge of tracing and debugging source code
  • Ability to work with a distributed team
  • Experience with computer science, data science, or other relevant applied mathematics discipline

Red Hat is proud to be an equal opportunity workplace and an affirmative action employer. We review applications for employment without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, ancestry, citizenship, age, uniformed services, genetic information, physical or mental disability, medical condition, marital status, or any other basis prohibited by law.

Red Hat does not seek or accept unsolicited resumes or CVs from recruitment agencies. We are not responsible for, and will not pay, any fees, commissions, or any other payment related to unsolicited resumes or CVs except as required in a written contract between Red Hat and the recruitment agency or party requesting payment of a fee.


Red Hat supports individuals with disabilities and provides reasonable accommodations to job applicants. If you need assistance completing our online job application, email General inquiries, such as those regarding the status of a job application, will not receive a reply.

Experience level of the applicant we want

Some work experience, Graduate