Back to Jobs

2 Weeks ago

Data Architect I

Apply Now

Hyderabad, Telangana, India

Healthcare & Life Sciences

Full-Time

UST

Overview

Role Description

Key Responsibilities

Data Strategy & Architecture Development
Define and implement data architecture and strategy that aligns with business goals.
Design scalable, cost-effective, and high-performance data solutions using Databricks on AWS, Azure, or GCP.
Establish best practices for Lakehouse Architecture and Delta Lake for optimized data storage, processing, and analytics.
Data Engineering & Integration
Architect and build ETL/ELT pipelines using Databricks Spark, Delta Live Tables, and Databricks Workflows.
Optimize data ingestion from systems like Oracle Fusion Middleware, WebMethods, MuleSoft, and Informatica into Databricks.
Ensure real-time and batch data processing with Apache Spark and Delta Lake.
Implement data integration strategies to ensure seamless connectivity with enterprise systems such as Salesforce, SAP, ERP, and CRM.
Data Governance, Security & Compliance
Implement data governance frameworks using Unity Catalog for data lineage, metadata management, and access control.
Ensure compliance with industry regulations like HIPAA, GDPR, and others in the life sciences domain.
Define and enforce Role-Based Access Control (RBAC) and data security best practices using Databricks SQL and access policies.
Enable data stewardship and ensure effective data cataloging for self-service data democratization.
Performance Optimization & Cost Management
Optimize Databricks compute clusters (DBU usage) for cost efficiency and performance.
Implement query optimization techniques using Photon Engine, Adaptive Query Execution (AQE), and caching strategies.
Monitor Databricks workspace health, job performance, and cost analytics.
AI/ML Enablement & Advanced Analytics
Design and support ML pipelines leveraging Databricks MLflow for model tracking and deployment.
Enable AI-driven analytics in genomics, drug discovery, and clinical data processing.
Collaborate with data scientists to operationalize AI/ML models in Databricks.
Collaboration & Stakeholder Alignment
Work closely with business teams, data engineers, AI/ML teams, and IT leadership to align data strategy with enterprise goals.
Collaborate with platform vendors (Databricks, AWS, Azure, GCP, Informatica, Oracle, MuleSoft) for solution architecture and support.
Provide technical leadership, conduct Proof of Concepts (PoCs), and drive Databricks adoption across the organization.
Data Democratization & Self-Service Enablement
Implement data sharing frameworks for self-service analytics using Databricks SQL and BI tools (Power BI, Tableau).
Promote data literacy and empower business users with self-service analytics.
Establish data lineage and cataloging to improve data discoverability and governance.
Migration & Modernization
Lead the migration of legacy data platforms (e.g., Informatica, Oracle, Hadoop) to the Databricks Lakehouse.
Design a roadmap for cloud modernization and ensure seamless data transition with minimal disruption.

Key Skills & Qualifications

Databricks & Spark Expertise
Strong knowledge of Databricks Lakehouse architecture (Delta Lake, Unity Catalog, Photon Engine).
Expertise in Apache Spark (PySpark, Scala, SQL) for large-scale data processing.
Experience with Databricks SQL and Delta Live Tables (DLT) for real-time and batch processing.
Proficiency with Databricks Workflows, Job Clusters, and Task Orchestration.
Cloud & Infrastructure Knowledge
Hands-on experience with Databricks on AWS, Azure, or GCP (preferred AWS Databricks).
Strong understanding of cloud storage (ADLS, S3, GCS) and cloud networking (VPC, IAM, Private Link).
Experience with Infrastructure as Code (Terraform, ARM, CloudFormation) for Databricks setup.
Data Modeling & Architecture
Expertise in data modeling (Dimensional, Star Schema, Snowflake, Data Vault).
Experience with Lakehouse, Data Mesh, and Data Fabric architectures.
Knowledge of data partitioning, indexing, caching, and query optimization techniques.
ETL/ELT & Data Integration
Experience designing scalable ETL/ELT pipelines using Databricks, Informatica, MuleSoft, or Apache NiFi.
Strong knowledge of batch and streaming ingestion (Kafka, Kinesis, Event Hubs, Auto Loader).
Expertise in Delta Lake & Change Data Capture (CDC) for real-time updates.
Data Governance & Security
Deep understanding of Unity Catalog, RBAC, and ABAC for data access control.
Experience with data lineage, metadata management, and compliance (HIPAA, GDPR, SOC 2).
Strong skills in data encryption, masking, and role-based access control (RBAC).
Performance Optimization & Cost Management
Ability to optimize Databricks clusters (DBU usage, Auto Scaling, Photon Engine) for cost efficiency.
Knowledge of query tuning, caching, and performance profiling techniques.
Experience in monitoring Databricks job performance using tools like Ganglia, CloudWatch, or Azure Monitor.
AI/ML & Advanced Analytics (Preferred)
Experience integrating Databricks MLflow for model tracking and deployment.
Knowledge of AI-driven analytics, particularly in genomics, drug discovery, and life sciences data processing.

Key Skills

Data Architecture
Databricks
Apache Spark
AI/ML
Cloud Platforms (AWS, Azure, GCP)
Data Governance & Security
ETL/ELT & Data Integration
Performance Optimization
Data Modeling

Skills

Data Architecture,Databricks,Apache Spark,AI/ML

Share job

Similar Jobs

View All

1 Day ago

QA Engineer – Mobile Gaming

Information Technology

Vishakhapatnam, Andhra Pradesh, India

About BeBettaBeBetta is a gamified reward platform designed for gamers and entertainers. We’re a mobile-first company growing quickly, with new features launching every week. Our mission is to transform how creators and users engage in the digital s...

More info

1 Day ago

DeepTek.ai - DevOps Engineer - Ansible/Terraform

Information Technology

Vishakhapatnam, Andhra Pradesh, India

Job Description : 1- 3 years of hands-on experience with AWS services (EC2, VPC, IAM, S3, CloudWatch, etc.)Required Skills Design and manage secure, scalable, and highly available AWS infrastructure. Deploy and manage containerized workloads using...

More info

1 Day ago

Data Scientist

Information Technology

Vishakhapatnam, Andhra Pradesh, India

About LoyalyticsLoyalytics is a fast-growing Analytics consulting and product organization based out of Bangalore.We work with large retail clients across the globe helping them monetize their data assets through our consulting assignments and produ...

More info

1 Day ago

Scrum master/ Senior Consultant Specialist

Information Technology

Vishakhapatnam, Andhra Pradesh, India

Job DescriptionSome careers shine brighter than others.If you’re looking for a career that will help you stand out, join HSBC, and fulfil your potential. Whether you want a career that could take you to the top, or simply take you in an exciting new...

More info

1 Day ago

Python Developer - Django

Information Technology

Vishakhapatnam, Andhra Pradesh, India

Job Title : Python Django Developer (3 Years Experience)Location : [Your Location / Remote / Hybrid]Job Type : [Full-time / Contract / Part-time]Experience : 3+ YearsAbout The RoleWe are looking for a skilled and motivated Python Django Develope...

More info

1 Day ago

IT - SDWan Engineer

Information Technology

Vishakhapatnam, Andhra Pradesh, India

Syensqo is all about chemistry. We’re not just referring to chemical reactions here, but also to the magic that occurs when the brightest minds get to work together. This is where our true strength lies. In you. In your future colleagues and in all ...

More info

1 Day ago

Senior UI Developer - React.js/AngularJS

Information Technology

Vishakhapatnam, Andhra Pradesh, India

Job Description : UX Developer.Location : Pune, India, Remote.Experience : 3-5 years.Job Type : the Role : We are seeking a talented UI/UX Developer with 35 years of experience to join our product engineering team.The ideal candidate will have a...

More info

1 Day ago

Motorola Solutions - Frontend/UI Developer - AngularJS

Information Technology

Vishakhapatnam, Andhra Pradesh, India

Department OverviewThe Cloud Platform Engineering team is responsible for : Design and implementation of the continuous integration/continuous delivery (CI/CD) pipeline into multiple public cloud regions Development and operation of common platfor...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in