Data Engineer
Location
Gurgaon, India
About Us
Zelestra (Formerly Known as Solarpack) is a purpose driven global renewables company, specializing in the development, construction, commercialization, and operation of large-scale renewable projects with a presence in fast-growing markets in India,Europe, North America, Latin America, Asia, and Africa.
Since its foundation in 2005, the company has developed or built on a turnkey or EPC basis power plants that represent a total capacity of 2.5 GW. The company has a project portfolio of 17 GW of projects across 14 countries, with 2 GW signed with customers, 1.3 GW in operations or construction, in the United States, Europe, Latin America and India.
Headquartered in Spain, Zelestra (Formerly Known as Solarpack) has over 700 professionals and is expected to be over 1,000 people at the end of 2024.
Zelestra (Formerly Known as Solarpack) is backed by EQT, one of three largest funds in the world with $232B in assets.
Our purpose is to accelerate the transition to clean and affordable energy for all. Our values are integral to our mission of leading the renewable energy charge, ensuring that we continue to set industry standards. We maintain a firm commitment to contribute directly to the social development of the communities and markets in which we operate, not only through the creation of economic value, but also through the generation of quality employment and through the social projects we promote.
Mission
We are seeking a skilled and experienced Data Engineer to join our team. In this role, you will be responsible for designing and managing robust data pipelines, building and maintaining databases, and deploying cloud infrastructure on AWS and Microsoft Azure. You will collaborate closely with our product manager and data scientist to ensure seamless data ingestion, model deployment, and system optimization. If you have a strong background in cloud technologies, data engineering, and a passion for renewable energy, we’d love to hear from you.
Responsibilities
- Database Development & Management: Design, build, and maintain databases to store and manage data from solar plants and wind farms. Fetch and organize real-time data from plants using OPC client protocols and ensure its efficient storage. Maintain version control for models and data pipelines using Git and GitHub.
- Data Pipeline Construction: Develop and manage data pipelines to ingest real-time data from various sources. Ensure the data pipelines are scalable, robust, and efficient, meeting the demands of large-scale data ingestion and processing.
- Cloud Infrastructure Management: Deploy and manage the necessary cloud infrastructure on AWS and Microsoft Azure. Position Title Data Engineer Department Digital BU/ DICI Location Gurgaon (India). Implement best practices for cloud resource optimization, including cost management, security, and scalability. First-hand experience of using services like Glue, Athena, S3, QuickSight etc. in AWS or Data Factory, Synapse Analytics, Blob Storage, PowerBi etc in Azure.
- Model Deployment & Support: Collaborate with data scientists to deploy machine learning models into production environments. Ensure models are integrated into data pipelines and are efficiently consuming and processing data.
- Monitoring & Maintenance: Monitor data fetching pipelines and cloud resources to ensure continuous operation and address any issues that arise. Implement automated monitoring solutions to detect and respond to system anomalies.
- Cross-Team Collaboration: Work closely with the product manager and data scientist to align engineering efforts with product goals. Provide technical insights and recommendations for future product development.
- Documentation & Reporting: Create detailed documentation for all processes, pipelines, and infrastructure configurations. Report on system performance and provide insights for optimization.
Job Requirements
- Experience: 3-5 years of experience as a software engineer or in a similar role, with a focus on cloud technologies and data engineering.
- Technical Skills: Proficiency in Python for building and maintaining data pipelines and working with data processing frameworks. Hands-on experience with cloud platforms, specifically AWS and Microsoft Azure. Experience with database design and management, including SQL and NoSQL databases. Familiarity with OPC client protocols and their application in industrial data fetching. Experience with Git and version control systems for collaborative work.
- Cloud & Infrastructure: Strong knowledge of cloud services like AWS EC2, S3, Lambda, RDS, Azure VMs, Azure Blob Storage, and Azure Functions. Experience in setting up and managing cloud-based data lakes and data warehouses.
- Data Engineering: Proven ability to build and manage data pipelines for real-time data ingestion and processing. Experience with monitoring and maintaining large-scale data systems.
- Collaboration & Communication: Excellent communication skills with the ability to work collaboratively in a cross-functional team environment. Ability to articulate technical concepts to non-technical stakeholders.
PREFERRED QUALIFICATIONS:
- Renewable Energy Experience: Experience working with data from renewable energy sources, specifically solar plants and wind farms.
- OPC Client Expertise: In-depth knowledge of OPC protocols and experience in integrating them with cloud-based systems.
- Certification: Certifications in AWS or Microsoft Azure cloud platforms. Data engineering or cloud architecture certifications would be an added advantage.
- Additional Skills: Experience with containerization technologies such as Docker and Kubernetes. Familiarity with DevOps practices and CI/CD pipelines.
Zelestra is an equal opportunity employer. We encourage applications from candidates of all backgrounds and experiences. If you are passionate about the intersection of data and renewable energy and want to be part of a team dedicated to making a positive impact, we invite you to apply.
JR1777
#LI-PO1
#LI-HYBRID