Senior GenAI Data Engineer
NTT America, Inc.
**Make an impact with NTT DATA**
Join a company that is pushing the boundaries of what is possible. We are renowned for our technical excellence and leading innovations, and for making a difference to our clients and society. Our workplace embraces diversity and inclusion – it’s a place where you can grow, belong and thrive.
**Your day at NTT DATA**
Senior GenAI Data Engineer
We are seeking an experienced Senior Data Engineer to join our team in delivering cutting-edge Generative AI (GenAI) solutions to clients. The successful candidate will be responsible for designing, developing, and deploying data pipelines and architectures that support the training, fine-tuning, and deployment of LLMs for various industries. This role requires strong technical expertise in data engineering, problem-solving skills, and the ability to work effectively with clients and internal teams.
**What you'll be doing**
**Key Responsibilities:**
+ **Design, develop, and manage** data pipelines and architectures to support GenAI model training, fine-tuning, and deployment
+ **Data Ingestion and Integration:** Develop data ingestion frameworks to collect data from various sources, transform, and integrate it into a unified data platform for GenAI model training and deployment.
+ **GenAI Model Integration:** Collaborate with data scientists to integrate GenAI models into production-ready applications, ensuring seamless model deployment, monitoring, and maintenance.
+ **Cloud Infrastructure Management:** Design, implement, and manage cloud-based data infrastructure (e.g., AWS, GCP, Azure) to support large-scale GenAI workloads, ensuring cost-effectiveness, security, and compliance.
+ **Write scalable, readable, and maintainable code** using object-oriented programming concepts in languages like Python, and utilize libraries like Hugging Face Transformers, PyTorch, or TensorFlow
+ **Performance Optimization:** Optimize data pipelines, GenAI model performance, and infrastructure for scalability, efficiency, and cost-effectiveness.
+ **Data Security and Compliance:** Ensure data security, privacy, and compliance with regulatory requirements (e.g., GDPR, HIPAA) across data pipelines and GenAI applications.
+ **Client Collaboration:** Collaborate with clients to understand their GenAI needs, design solutions, and deliver high-quality data engineering services.
+ **Innovation and R&D:** Stay up to date with the latest GenAI trends, technologies, and innovations, applying research and development skills to improve data engineering services.
+ **Knowledge Sharing:** Share knowledge, best practices, and expertise with team members, contributing to the growth and development of the team.
**Requirements:**
+ Bachelor’s degree in computer science, Engineering, or related fields (Master's recommended)
+ Experience with vector databases (e.g., Pinecone, Weaviate, Faiss, Annoy) for efficient similarity search and storage of dense vectors in GenAI applications
+ 5+ years of experience in data engineering, with a strong emphasis on cloud environments (AWS, GCP, Azure, or Cloud Native platforms)
+ Proficiency in programming languages like SQL, Python, and PySpark
+ Strong data architecture, data modeling, and data governance skills
+ Experience with Big Data Platforms (Hadoop, Databricks, Hive, Kafka, Apache Iceberg), Data Warehouses (Teradata, Snowflake, BigQuery), and lakehouses (Delta Lake, Apache Hudi)
+ Knowledge of DevOps practices, including Git workflows and CI/CD pipelines (Azure DevOps, Jenkins, GitHub Actions)
+ Experience with GenAI frameworks and tools (e.g., TensorFlow, PyTorch, Keras)
+ **Nice to have:**
+ Experience with containerization and orchestration tools like Docker and Kubernetes
+ Integrate vector databases and implement similarity search techniques, with a focus on GraphRAG is a plus
+ Familiarity with API gateway and service mesh architectures
+ Experience with low latency/streaming, batch, and micro-batch processing
+ Familiarity with Linux-based operating systems and REST APIs
**Location:** Delhi or Bangalore
**Workplace type** **:**
Hybrid Working
**About NTT DATA**
NTT DATA is a $30+ billion trusted global innovator of business and technology services. We serve 75% of the Fortune Global 100 and are committed to helping clients innovate, optimize and transform for long-term success. We invest over $3.6 billion each year in R&D to help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have diverse experts in more than 50 countries and a robust partner ecosystem of established and start-up companies. Our services include business and technology consulting, data and artificial intelligence, industry solutions, as well as the development, implementation and management of applications, infrastructure, and connectivity. We are also one of the leading providers of digital and AI infrastructure in the world. NTT DATA is part of NTT Group and headquartered in Tokyo.
**Equal Opportunity Employer**
NTT DATA is proud to be an Equal Opportunity Employer with a global culture that embraces diversity. We are committed to providing an environment free of unfair discrimination and harassment. We do not discriminate based on age, race, colour, gender, sexual orientation, religion, nationality, disability, pregnancy, marital status, veteran status, or any other protected category. Join our growing global team and accelerate your career with us. Apply today.
Confirm your E-mail: Send Email
All Jobs from NTT America, Inc.