Overview
We are seeking Cloud Data Engineer Intern, who will be part of the Engineering team and collaborating with software development, quality assurance, and IT operations teams to build and maintain systems that collect, manage, and convert raw data into information that can be used by business analysts and data scientists.
This role requires a engineer who is passionate about working with large amount of data & analytics. We are a small team of highly skilled engineers and looking forward to adding a new member who wishes to advance in one's career by continuous learning. Selected candidates will be an integral part of a team of passionate and enthusiastic IT professionals, and have tremendous opportunities to contribute to the success of the products that we build.
What you will do
- Ideal candidate will be responsible for designing, building and maintaining data solutions and workflows in the Cloud
- Develops and maintains scalable data pipelines and builds out new API integrations to support continuing increases in data volume and complexity.
- Collaborates with analytics and business teams to improve data models that feed business intelligence tools, increasing data accessibility and fostering data-driven decision making across the organization.
- Implements processes and systems to monitor data quality, ensuring production data is always accurate and available for key stakeholders and business processes that depend on it.
- Writes unit/integration tests, contributes to engineering wiki, and documents work.
- Performs data analysis required to troubleshoot data related issues and assist in the resolution of data issues.
- Works closely with a team of frontend and backend engineers, product managers, and analysts.
- Engineer solutions using LLMs, Python, SQL
- Resolving data problems across multiple application domains and platforms using system troubleshooting and problem-solving techniques
- Collaborate with development, QA, and operations teams to design and implement data pipelines.
- Defines company data assets (data models), and jobs to populate data models.
- Designs data integrations and data quality framework.
- Designs and evaluates open source and vendor tools for data lineage.
- Works closely with all business units and engineering teams to develop strategy for long term data platform architecture.
- Promotes knowledge sharing activities within and across different product teams by creating and engaging in communities of practice and through documentation, training, and mentoring
- Keep skills up to date through ongoing self-directed training
What skills are required
- Ability to learn new technologies quickly.
- Ability to work both independently and in collaborative teams to communicate design and build ideas effectively.
- Problem-solving, and critical-thinking skills including ability to organize, analyze, interpret, and disseminate information.
- Excellent spoken and written communication skills
- Must be able to work as part of a diverse team, as well as independently
- Ability to follow departmental and organizational processes and meet established goals and deadlines
- Experience with LLMs, PyTorch, TensorFlow. Prompt Engineering will be a plus
- Working Knowledge in Java, Xml, Json, SQL
- Knowledge of scripting and automation using Python, Bash, Perl to automate AWS tasks
- Bachelor's degree in Engineering or Masters degree in computer science.
Note : Candidates who have passed out in the year 2023 or 2024 can only apply for this Internship.
This is Internship to Hire position and Candidates who complete the internship will be offered full-time position based on performance
Job Types: Full-time, Permanent, Fresher, Internship
Contract length: 6 months
Pay: ₹5,500.00 - ₹7,000.00 per month
Schedule:
- Day shift
- Monday to Friday
- Morning shift
Expected Start Date: 01/07/2025