Free cookie consent management tool by TermsFeed AI Data Engineer | Antal Tech Jobs
Back to Jobs
1 Day ago

AI Data Engineer

decor
Hyderabad, Telangana, India
Information Technology
Full-Time
YO IT Consulting

Overview

Location- Hyderabad

Experience- 5 to 8 years

Must Have

Experience as a AI Data Engineer.

Experience in DVC (Data Version Control) and Airflow.

Experience with Apache Spark, Flink, and Kafka.

Experience in advanced level Python and AI logic and Rust (or C++).

Experience in Vector Database Mastery like configuration of HNSW indexes, scalar quantization, and metadata filtering strategies.

Role Overview

Seeking a hardcore, hands-on AI Data Engineer to build the high-performance data infrastructure required to power autonomous AI agents. You won't just be moving data from A to B; you will be architecting Dynamic Context Windows, managing Real-time Semantic Indexes, and building Self-Cleaning Data Pipelines that feed our "Super Employee" agents.

Key Responsibilities

Vector & Graph ETL: Design and maintain pipelines that transform unstructured data (PDFs, emails, logs, chats) into optimized embeddings for Vector Databases (Pinecone, Weaviate, Milvus).

Semantic Data Modeling: Engineer data structures that optimize for Retrieval-Augmented Generation (RAG), ensuring agents find the "needle in the haystack" in milliseconds.

Knowledge Graph Construction: Build and scale Knowledge Graphs (Neo4j) to represent complex relationships in our trading and support data that standard vector search misses.

Automated Data Labeling & Synthetic Data: Implement pipelines using LLMs to auto-label datasets or generate synthetic edge cases for agent training and evaluation.

Stream Processing for Agents: Build real-time data "listeners" (Kafka/Flink) that feed live context to agents, allowing them to react to market or support events as they happen.

Data Reliability & "Drift" Detection: Build monitoring for "Embedding Drift", identifying when the statistical distribution of your data changes and the agent's "knowledge" becomes stale.

Qualifications

Vector Database Mastery: Expert-level configuration of HNSW indexes, scalar quantization, and metadata filtering strategies within Pinecone, Milvus, or Qdrant.

Advanced Python & Rust: Proficiency in Python for AI logic and Rust (or C++) for high-performance data processing and custom embedding functions.

Big Data Ecosystem: Hands-on experience with Apache Spark, Flink, and Kafka in a high-throughput environment (Trading/FinTech preferred).

LLM Data Tooling: Deep experience with Unstructured.io, LlamaIndex, or LangChain for document parsing and chunking strategy optimization.

MLOps & DataOps: Mastery of DVC (Data Version Control) and Airflow/Prefect for managing complex, non-linear AI data workflows.

Embedding Models: Understanding of how to fine-tune embedding models (e.g., BGE, Cohere, or OpenAI) to better represent domain-specific (Trading) terminology.

Additional Qualifications

Chunking Strategy Architect: You don't just "split text." You implement Semantic Chunking and Parent-Child retrieval strategies to maximize LLM context relevance.

Cold/Warm/Hot Storage Strategy: Managing cost and latency by tiering data between Vector DBs (Hot), SQL/NoSQL (Warm), and S3/Data Lakes (Cold).

Privacy & Redaction Pipelines: Building automated PII (Personally Identifiable Information) redaction into the ingestion layer to ensure agents never "see" or "leak" sensitive user data

Share job
Similar Jobs
View All
1 Day ago
Lead Data Engineer - Artificial Intelligence/Machine Learning
Information Technology
  • Hyderabad, Telangana, India
DescriptionDuties & Responsibilities & Modeling : Develop and deploy time-series forecasting models (e.g., Prophet, ARIMA, DeepAR, LSTM, Temporal Fusion Transformer) to predict demand, revenue, and promotion lift. Apply advanced statistical and causa...
decor
1 Day ago
Java Developer - Spring Boot/Microservices Architecture
Information Technology
  • Hyderabad, Telangana, India
Job DescriptionJob Summary :We are looking for a skilled Java Developer to design, develop, and maintain scalable enterprise applications. The ideal candidate will have strong expertise in Java and modern backend technologies, with experience in buil...
decor
1 Day ago
Business Analyst 3
Information Technology
  • Hyderabad, Telangana, India
Comcast brings together the best in media and technology. We drive innovation to create the world's best entertainment and online experiences. As a Fortune 50 leader, we set the pace in a variety of innovative and fascinating businesses and create ca...
decor
1 Day ago
Cyber Security Analyst
Information Technology
  • Hyderabad, Telangana, India
Job Title: Cyber Security AnalystLocation: BangaloreExperience Required: 5–7 YearsEmployment Type: Full-TimeJob SummaryWe are looking for a highly skilled and detail-oriented *Security Analyst* with strong experience in SOC/NOC operations, threat mon...
decor
1 Day ago
Senior Data Engineer - Python/Spark
Information Technology
  • Hyderabad, Telangana, India
DescriptionDuties and Responsibilities : Design, build and test end to end data pipeline including data ingestion (streaming, events and batch), data integration, data curation Build and support data platform on the cloud Define and implement automat...
decor
1 Day ago
Junior Python Developer in Bangalore, Anantapur, Mysuru, Hyderabad, Delhi, Chennai, Gokarna, Udupi, Tumakuru, Andra
Information Technology
  • Hyderabad, Telangana, India
As a junior Python developer at Time Line Investments, you will have the opportunity to work on cutting-edge projects in the finance industry. You will be responsible for creating and maintaining Python applications, developing backend systems, and u...
decor
1 Day ago
Backend Java Developer
Information Technology
  • Hyderabad, Telangana, India
Backend Java Developer – Data Fabric / Platform EngineeringLocation: Pune (Hybrid)Employment: PermanentExperience: 4 to 8 yearsIf your idea of backend engineering is more than CRUD APIs and microservices boilerplate — this role is for you.We’re build...
decor
1 Day ago
Associate Lead Data Scientist - AI/ML Job
Information Technology
  • Hyderabad, Telangana, India
We use cookies to offer you the best possible website experience. Your cookie preferences will be stored in your browser’s local storage. This includes cookies necessary for the website's operation. Additionally, you can freely decide and change any ...
decor

Talk to us

Feel free to call, email, or hit us up on our social media accounts.
Social media