HTGAJ Admin — AI Engineer (Generative AI & RAG Specialist)

Job Detail

AI Engineer (Generative AI & RAG Specialist)

DIMAAG · California, US

Data Science and AI Full–time

ID: #19700

Posted: 2026-03-06

Apply Link

Salary

—

Description

Company Description Dimaag is a leading design and technology company that specializes in AI solutions across multiple industry verticals including Smart Factory. Established in 2018 and headquartered in Silicon Valley with offices in Osaka, Japan, and Bangalore, India, Dimaag's EV business unit has a strong presence in deployed cutting edge industry solutions through its proprietary ENCORE ecosystem of EV components and charging solutions. Join Dimaag in its mission to create sustainable, high-performance technology for a better future. Role Description This is a full-time, on-site role for an AI Engineer (Generative AI & RAG Specialist) based in Fremont, CA. The AI Engineer will focus on building and optimizing state-of-the-art generative AI and retrieval-augmented generation (RAG) models. This individual will design and deploy scalable production systems using Large Language Models (LLMs). with a focus on building robust Retrieval-Augmented Generation (RAG) pipelines and optimizing transformer-based architectures to solve complex problems. Key Responsibilities Architect RAG Pipelines: Develop and optimize end-to-end RAG systems for multimodal data, including document parsing, embedding strategies, and vector database management. LLM Implementation: Select, fine-tune, and deploy LLMs (OpenAI, Anthropic, Llama, etc.) using frameworks like LangChain or LlamaIndex. Model Optimization: Work with transformer architectures to improve inference speed and accuracy (quantization, pruning, or prompt engineering). Data Engineering: Manage unstructured data workflows and high-dimensional vector search (e.g., Pinecone Qdrant, Weaviate, or Milvus). Required Skill Set Core AI: Deep understanding of the Transformer architecture (Attention mechanisms, encoders/decoders). Frameworks: Proficiency in PyTorch or TensorFlow, and orchestration tools like LangChain. Vector DBs: Hands-on experience with vector similarity search and indexing. Programming: Expert-level Python and experience with API integration. Deployment: Familiarity with cloud AI services (AWS Bedrock, GCP Vertex AI, or Azure AI). Preferred Qualifications Experience with fine-tuning techniques like LoRA or QLoRA. Contributions to open-source AI projects or research publications. Knowledge of evaluation frameworks for LLMs (e.g., RAGAS or TruLens).

Hard Skills 12

Skill	Source	Confidence
Vector Databases	llm_hard	100%
Large Language Models (LLMs)	llm_hard	100%
TensorFlow	llm_hard	100%
PyTorch	llm_hard	100%
Prompt Engineering	llm_hard	100%
Fine-tuning Models	llm_hard	100%
RAG (Retrieval-Augmented Generation)	llm_hard	100%
Model Optimization	llm_hard	100%
Python	llm_hard	100%
Azure ML	llm_hard	80%
Google Cloud AI	llm_hard	80%
AWS (SageMaker, EC2, S3)	llm_hard	80%

Soft Skills 11

Skill	Source	Confidence
Tech Savviness	llm_soft	100%
Digital Literacy	llm_soft	100%
Critical Thinking	llm_soft	80%
Analytical Thinking	llm_soft	80%
Creative Problem Solving	llm_soft	80%
Innovation	llm_soft	80%
Creative Thinking	llm_soft	80%
Research Skills	llm_soft	80%
Decision-Making	llm_soft	80%
Adapting to New Technology	llm_soft	80%
Problem-Solving	llm_soft	80%

Apply Options

Publisher	Direct	Link
LinkedIn	No	Apply
Jobright	No	Apply

API Logs for this Job

Query	Country	Status	Response ms	Created
AI Engineer (Generative AI & RAG Specialist)		extracted	5920	2026-03-28 10:58
AI Engineer (Generative AI & RAG Specialist)		classified	434	2026-03-28 10:24
machine learning engineer	gb	processed	16939	2026-03-28 10:08

Raw JSON

{
  "job_id": "GLjIb6nCF94zVt0aAAAAAA==",
  "job_city": null,
  "job_state": "California",
  "job_title": "AI Engineer (Generative AI & RAG Specialist)",
  "job_salary": null,
  "job_country": "US",
  "job_benefits": null,
  "job_latitude": 36.778261,
  "job_location": "California, United States",
  "job_onet_soc": "15111100",
  "apply_options": [
    {
      "is_direct": false,
      "publisher": "LinkedIn",
      "apply_link": "https://www.linkedin.com/jobs/view/ai-engineer-generative-ai-rag-specialist-at-dimaag-4381061455?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    },
    {
      "is_direct": false,
      "publisher": "Jobright",
      "apply_link": "https://jobright.ai/jobs/info/69ab40727e1fab39d382d902?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
    }
  ],
  "employer_logo": "https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQNxnNsAMK1r5uys0BhHz56I_fqlyC_cLw9uEQ9&s=0",
  "employer_name": "DIMAAG",
  "job_is_remote": false,
  "job_longitude": -119.4179324,
  "job_posted_at": "22 days ago",
  "job_publisher": "LinkedIn",
  "job_apply_link": "https://www.linkedin.com/jobs/view/ai-engineer-generative-ai-rag-specialist-at-dimaag-4381061455?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic",
  "job_highlights": {
    "Qualifications": [
      "Required Skill Set",
      "Core AI: Deep understanding of the Transformer architecture (Attention mechanisms, encoders/decoders)",
      "Frameworks: Proficiency in PyTorch or TensorFlow, and orchestration tools like LangChain",
      "Vector DBs: Hands-on experience with vector similarity search and indexing",
      "Programming: Expert-level Python and experience with API integration",
      "Deployment: Familiarity with cloud AI services (AWS Bedrock, GCP Vertex AI, or Azure AI)",
      "Experience with fine-tuning techniques like LoRA or QLoRA",
      "Contributions to open-source AI projects or research publications",
      "Knowledge of evaluation frameworks for LLMs (e.g., RAGAS or TruLens)"
    ],
    "Responsibilities": [
      "The AI Engineer will focus on building and optimizing state-of-the-art generative AI and retrieval-augmented generation (RAG) models",
      "This individual will design and deploy scalable production systems using Large Language Models (LLMs)",
      "with a focus on building robust Retrieval-Augmented Generation (RAG) pipelines and optimizing transformer-based architectures to solve complex problems",
      "Architect RAG Pipelines: Develop and optimize end-to-end RAG systems for multimodal data, including document parsing, embedding strategies, and vector database management",
      "LLM Implementation: Select, fine-tune, and deploy LLMs (OpenAI, Anthropic, Llama, etc.) using frameworks like LangChain or LlamaIndex",
      "Model Optimization: Work with transformer architectures to improve inference speed and accuracy (quantization, pruning, or prompt engineering)",
      "Data Engineering: Manage unstructured data workflows and high-dimensional vector search (e.g., Pinecone Qdrant, Weaviate, or Milvus)"
    ]
  },
  "job_max_salary": null,
  "job_min_salary": null,
  "job_description": "Company Description\n\nDimaag is a leading design and technology company that specializes in AI solutions across multiple industry verticals including Smart Factory. Established in 2018 and headquartered in Silicon Valley with offices in Osaka, Japan, and Bangalore, India, Dimaag's EV business unit has a strong presence in deployed cutting edge industry solutions through its proprietary ENCORE ecosystem of EV components and charging solutions. Join Dimaag in its mission to create sustainable, high-performance technology for a better future.\n\nRole Description\n\nThis is a full-time, on-site role for an AI Engineer (Generative AI & RAG Specialist) based in Fremont, CA. The AI Engineer will focus on building and optimizing state-of-the-art generative AI and retrieval-augmented generation (RAG) models. This individual will design and deploy scalable production systems using Large Language Models (LLMs). with a focus on building robust Retrieval-Augmented Generation (RAG) pipelines and optimizing transformer-based architectures to solve complex problems.\n\nKey Responsibilities\n\nArchitect RAG Pipelines: Develop and optimize end-to-end RAG systems for multimodal data, including document parsing, embedding strategies, and vector database management.\n\nLLM Implementation: Select, fine-tune, and deploy LLMs (OpenAI, Anthropic, Llama, etc.) using frameworks like LangChain or LlamaIndex.\n\nModel Optimization: Work with transformer architectures to improve inference speed and accuracy (quantization, pruning, or prompt engineering).\n\nData Engineering: Manage unstructured data workflows and high-dimensional vector search (e.g., Pinecone Qdrant, Weaviate, or Milvus).\n\nRequired Skill Set\n\nCore AI: Deep understanding of the Transformer architecture (Attention mechanisms, encoders/decoders).\n\nFrameworks: Proficiency in PyTorch or TensorFlow, and orchestration tools like LangChain.\n\nVector DBs: Hands-on experience with vector similarity search and indexing.\n\nProgramming: Expert-level Python and experience with API integration.\n\nDeployment: Familiarity with cloud AI services (AWS Bedrock, GCP Vertex AI, or Azure AI).\n\nPreferred Qualifications\n\nExperience with fine-tuning techniques like LoRA or QLoRA.\n\nContributions to open-source AI projects or research publications.\n\nKnowledge of evaluation frameworks for LLMs (e.g., RAGAS or TruLens).",
  "job_google_link": "https://www.google.com/search?ibp=htl;jobs&q&htidocid=GLjIb6nCF94zVt0aAAAAAA%3D%3D&hl=en-GB&shndl=37&shmd=H4sIAAAAAAAA_xWNMQrCQBBFsc0RrAYEUZGsCDZaBQ0hhY3BOqxx3KysM8vOIim9g6fwWp7EtfnFg_9e9hllZVFDScYSYoBZhYRBR_tESHwKp6KCxmNntbMS5_B9veHOFxDUoeuBCSpm43C862P0slVKxOVGYnJ0eccPxYQXHlQ6yX9a6XVA73TEdr1ZDbkns5gc6mORSpZgn0I3DmT1Es5kI16hSTKUH6XHyM6sAAAA&shmds=v1_ATWGeePBg_px-R8F447v3zPzy-psuIH7qUM1ORdN2ChzBV-i3Q&source=sh/x/job/li/m1/1#fpstate=tldetail&htivrt=jobs&htiq&htidocid=GLjIb6nCF94zVt0aAAAAAA%3D%3D",
  "employer_website": null,
  "job_onet_job_zone": "5",
  "job_salary_period": null,
  "job_apply_is_direct": false,
  "job_employment_type": "Full–time",
  "job_employment_types": [
    "FULLTIME"
  ],
  "job_posted_at_timestamp": 1772755200,
  "job_posted_at_datetime_utc": "2026-03-06T00:00:00.000Z"
}