Back to Jobs

1 Day ago

AI Data Engineer

Apply Now

Hyderabad, Telangana, India

Information Technology

Full-Time

YO IT Consulting

Overview

Location- Hyderabad

Experience- 5 to 8 years

Must Have

Experience as a AI Data Engineer.

Experience in DVC (Data Version Control) and Airflow.

Experience with Apache Spark, Flink, and Kafka.

Experience in advanced level Python and AI logic and Rust (or C++).

Experience in Vector Database Mastery like configuration of HNSW indexes, scalar quantization, and metadata filtering strategies.

Role Overview

Seeking a hardcore, hands-on AI Data Engineer to build the high-performance data infrastructure required to power autonomous AI agents. You won't just be moving data from A to B; you will be architecting Dynamic Context Windows, managing Real-time Semantic Indexes, and building Self-Cleaning Data Pipelines that feed our "Super Employee" agents.

Key Responsibilities

Vector & Graph ETL: Design and maintain pipelines that transform unstructured data (PDFs, emails, logs, chats) into optimized embeddings for Vector Databases (Pinecone, Weaviate, Milvus).

Semantic Data Modeling: Engineer data structures that optimize for Retrieval-Augmented Generation (RAG), ensuring agents find the "needle in the haystack" in milliseconds.

Knowledge Graph Construction: Build and scale Knowledge Graphs (Neo4j) to represent complex relationships in our trading and support data that standard vector search misses.

Automated Data Labeling & Synthetic Data: Implement pipelines using LLMs to auto-label datasets or generate synthetic edge cases for agent training and evaluation.

Stream Processing for Agents: Build real-time data "listeners" (Kafka/Flink) that feed live context to agents, allowing them to react to market or support events as they happen.

Data Reliability & "Drift" Detection: Build monitoring for "Embedding Drift", identifying when the statistical distribution of your data changes and the agent's "knowledge" becomes stale.

Qualifications

Vector Database Mastery: Expert-level configuration of HNSW indexes, scalar quantization, and metadata filtering strategies within Pinecone, Milvus, or Qdrant.

Advanced Python & Rust: Proficiency in Python for AI logic and Rust (or C++) for high-performance data processing and custom embedding functions.

Big Data Ecosystem: Hands-on experience with Apache Spark, Flink, and Kafka in a high-throughput environment (Trading/FinTech preferred).

LLM Data Tooling: Deep experience with Unstructured.io, LlamaIndex, or LangChain for document parsing and chunking strategy optimization.

MLOps & DataOps: Mastery of DVC (Data Version Control) and Airflow/Prefect for managing complex, non-linear AI data workflows.

Embedding Models: Understanding of how to fine-tune embedding models (e.g., BGE, Cohere, or OpenAI) to better represent domain-specific (Trading) terminology.

Additional Qualifications

Chunking Strategy Architect: You don't just "split text." You implement Semantic Chunking and Parent-Child retrieval strategies to maximize LLM context relevance.

Cold/Warm/Hot Storage Strategy: Managing cost and latency by tiering data between Vector DBs (Hot), SQL/NoSQL (Warm), and S3/Data Lakes (Cold).

Privacy & Redaction Pipelines: Building automated PII (Personally Identifiable Information) redaction into the ingestion layer to ensure agents never "see" or "leak" sensitive user data

Share job

Similar Jobs

View All

1 Day ago

Lead Data Engineer - Artificial Intelligence/Machine Learning

Information Technology

Hyderabad, Telangana, India

DescriptionDuties & Responsibilities & Modeling : Develop and deploy time-series forecasting models (e.g., Prophet, ARIMA, DeepAR, LSTM, Temporal Fusion Transformer) to predict demand, revenue, and promotion lift. Apply advanced statistical and causa...

More info

1 Day ago

Java Developer - Spring Boot/Microservices Architecture

Information Technology

Hyderabad, Telangana, India

Job DescriptionJob Summary :We are looking for a skilled Java Developer to design, develop, and maintain scalable enterprise applications. The ideal candidate will have strong expertise in Java and modern backend technologies, with experience in buil...

More info

1 Day ago

Business Analyst 3

Information Technology

Hyderabad, Telangana, India

Comcast brings together the best in media and technology. We drive innovation to create the world's best entertainment and online experiences. As a Fortune 50 leader, we set the pace in a variety of innovative and fascinating businesses and create ca...

More info

1 Day ago

Cyber Security Analyst

Information Technology

Hyderabad, Telangana, India

Job Title: Cyber Security AnalystLocation: BangaloreExperience Required: 5–7 YearsEmployment Type: Full-TimeJob SummaryWe are looking for a highly skilled and detail-oriented *Security Analyst* with strong experience in SOC/NOC operations, threat mon...

More info

1 Day ago

Senior Data Engineer - Python/Spark

Information Technology

Hyderabad, Telangana, India

DescriptionDuties and Responsibilities : Design, build and test end to end data pipeline including data ingestion (streaming, events and batch), data integration, data curation Build and support data platform on the cloud Define and implement automat...

More info

1 Day ago

Junior Python Developer in Bangalore, Anantapur, Mysuru, Hyderabad, Delhi, Chennai, Gokarna, Udupi, Tumakuru, Andra

Information Technology

Hyderabad, Telangana, India

As a junior Python developer at Time Line Investments, you will have the opportunity to work on cutting-edge projects in the finance industry. You will be responsible for creating and maintaining Python applications, developing backend systems, and u...

More info

1 Day ago

Backend Java Developer

Information Technology

Hyderabad, Telangana, India

Backend Java Developer – Data Fabric / Platform EngineeringLocation: Pune (Hybrid)Employment: PermanentExperience: 4 to 8 yearsIf your idea of backend engineering is more than CRUD APIs and microservices boilerplate — this role is for you.We’re build...

More info

1 Day ago

Associate Lead Data Scientist - AI/ML Job

Information Technology

Hyderabad, Telangana, India

We use cookies to offer you the best possible website experience. Your cookie preferences will be stored in your browser’s local storage. This includes cookies necessary for the website's operation. Additionally, you can freely decide and change any ...

More info

Talk to us

Feel free to call, email, or hit us up on our social media accounts.

Email info@antaltechjobs.in