We are seeking a highly skilled Senior Data Engineer to join our team. The ideal candidate will be responsible for designing and implementing data architectures that support our business goals. This is a critical role that requires a strong understanding of data processing, data modeling, and data storage.
Duties:
- Design and implement data pipelines using Apache Hive, Talend, and other relevant tools.
- Develop and maintain data warehouses to store and manage large datasets.
- Create and manage data models to support business intelligence and analytics applications.
- Develop and deploy RESTful APIs to integrate with various data sources and applications.
- Design and implement data quality control processes to ensure data accuracy and integrity.
- Collaborate with data scientists to develop and train machine learning models using linked data and other relevant tools.
- Monitor and troubleshoot data pipelines and applications using tools such as watch and smash.
- Provide technical support and training to data analysts and other stakeholders.
- Stay up-to-date with the latest developments in data engineering and apply new technologies and techniques as needed.
Experience:
- Bachelor's degree in Computer Science, Data Science, or a related field.
- 7+ years of experience in data engineering, data science, or a related field.
- Strong experience with data warehouse management and SQL programming.
- Experience with data modeling and data quality control processes.
- Experience with RESTful API design and development.
- Experience with machine learning model training and linked data.
- Experience with Apache Hive and Talend.
- Experience with data pipeline management and data integration.
- Excellent analytical and problem-solving skills.
- Strong communication and collaboration skills.
- Experience with data visualization tools and techniques.
- Experience with cloud-based data engineering platforms and tools.
Skills:
- Data warehouse management
- RESTful API design and development
- Model training
- SQL programming
- Linked data
- Databricks
- Python
- Apache Hive
- Smash