—
Company Description Dimaag is a leading design and technology company that specializes in AI solutions across multiple industry verticals including Smart Factory. Established in 2018 and headquartered in Silicon Valley with offices in Osaka, Japan, and Bangalore, India, Dimaag's EV business unit has a strong presence in deployed cutting edge industry solutions through its proprietary ENCORE ecosystem of EV components and charging solutions. Join Dimaag in its mission to create sustainable, high-performance technology for a better future. Role Description This is a full-time, on-site role for an AI Engineer (Generative AI & RAG Specialist) based in Fremont, CA. The AI Engineer will focus on building and optimizing state-of-the-art generative AI and retrieval-augmented generation (RAG) models. This individual will design and deploy scalable production systems using Large Language Models (LLMs). with a focus on building robust Retrieval-Augmented Generation (RAG) pipelines and optimizing transformer-based architectures to solve complex problems. Key Responsibilities Architect RAG Pipelines: Develop and optimize end-to-end RAG systems for multimodal data, including document parsing, embedding strategies, and vector database management. LLM Implementation: Select, fine-tune, and deploy LLMs (OpenAI, Anthropic, Llama, etc.) using frameworks like LangChain or LlamaIndex. Model Optimization: Work with transformer architectures to improve inference speed and accuracy (quantization, pruning, or prompt engineering). Data Engineering: Manage unstructured data workflows and high-dimensional vector search (e.g., Pinecone Qdrant, Weaviate, or Milvus). Required Skill Set Core AI: Deep understanding of the Transformer architecture (Attention mechanisms, encoders/decoders). Frameworks: Proficiency in PyTorch or TensorFlow, and orchestration tools like LangChain. Vector DBs: Hands-on experience with vector similarity search and indexing. Programming: Expert-level Python and experience with API integration. Deployment: Familiarity with cloud AI services (AWS Bedrock, GCP Vertex AI, or Azure AI). Preferred Qualifications Experience with fine-tuning techniques like LoRA or QLoRA. Contributions to open-source AI projects or research publications. Knowledge of evaluation frameworks for LLMs (e.g., RAGAS or TruLens).
| Skill | Source | Confidence |
|---|---|---|
| Vector Databases | llm_hard |
100%
|
| Large Language Models (LLMs) | llm_hard |
100%
|
| TensorFlow | llm_hard |
100%
|
| PyTorch | llm_hard |
100%
|
| Prompt Engineering | llm_hard |
100%
|
| Fine-tuning Models | llm_hard |
100%
|
| RAG (Retrieval-Augmented Generation) | llm_hard |
100%
|
| Model Optimization | llm_hard |
100%
|
| Python | llm_hard |
100%
|
| Azure ML | llm_hard |
80%
|
| Google Cloud AI | llm_hard |
80%
|
| AWS (SageMaker, EC2, S3) | llm_hard |
80%
|
| Skill | Source | Confidence |
|---|---|---|
| Tech Savviness | llm_soft |
100%
|
| Digital Literacy | llm_soft |
100%
|
| Critical Thinking | llm_soft |
80%
|
| Analytical Thinking | llm_soft |
80%
|
| Creative Problem Solving | llm_soft |
80%
|
| Innovation | llm_soft |
80%
|
| Creative Thinking | llm_soft |
80%
|
| Research Skills | llm_soft |
80%
|
| Decision-Making | llm_soft |
80%
|
| Adapting to New Technology | llm_soft |
80%
|
| Problem-Solving | llm_soft |
80%
|
| Query | Country | Status | Response ms | Created |
|---|---|---|---|---|
| AI Engineer (Generative AI & RAG Specialist) | extracted | 5920 | 2026-03-28 10:58 | |
| AI Engineer (Generative AI & RAG Specialist) | classified | 434 | 2026-03-28 10:24 | |
| machine learning engineer | gb | processed | 16939 | 2026-03-28 10:08 |
{
"job_id": "GLjIb6nCF94zVt0aAAAAAA==",
"job_city": null,
"job_state": "California",
"job_title": "AI Engineer (Generative AI & RAG Specialist)",
"job_salary": null,
"job_country": "US",
"job_benefits": null,
"job_latitude": 36.778261,
"job_location": "California, United States",
"job_onet_soc": "15111100",
"apply_options": [
{
"is_direct": false,
"publisher": "LinkedIn",
"apply_link": "https://www.linkedin.com/jobs/view/ai-engineer-generative-ai-rag-specialist-at-dimaag-4381061455?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
},
{
"is_direct": false,
"publisher": "Jobright",
"apply_link": "https://jobright.ai/jobs/info/69ab40727e1fab39d382d902?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic"
}
],
"employer_logo": "https://encrypted-tbn0.gstatic.com/images?q=tbn:ANd9GcQNxnNsAMK1r5uys0BhHz56I_fqlyC_cLw9uEQ9&s=0",
"employer_name": "DIMAAG",
"job_is_remote": false,
"job_longitude": -119.4179324,
"job_posted_at": "22 days ago",
"job_publisher": "LinkedIn",
"job_apply_link": "https://www.linkedin.com/jobs/view/ai-engineer-generative-ai-rag-specialist-at-dimaag-4381061455?utm_campaign=google_jobs_apply&utm_source=google_jobs_apply&utm_medium=organic",
"job_highlights": {
"Qualifications": [
"Required Skill Set",
"Core AI: Deep understanding of the Transformer architecture (Attention mechanisms, encoders/decoders)",
"Frameworks: Proficiency in PyTorch or TensorFlow, and orchestration tools like LangChain",
"Vector DBs: Hands-on experience with vector similarity search and indexing",
"Programming: Expert-level Python and experience with API integration",
"Deployment: Familiarity with cloud AI services (AWS Bedrock, GCP Vertex AI, or Azure AI)",
"Experience with fine-tuning techniques like LoRA or QLoRA",
"Contributions to open-source AI projects or research publications",
"Knowledge of evaluation frameworks for LLMs (e.g., RAGAS or TruLens)"
],
"Responsibilities": [
"The AI Engineer will focus on building and optimizing state-of-the-art generative AI and retrieval-augmented generation (RAG) models",
"This individual will design and deploy scalable production systems using Large Language Models (LLMs)",
"with a focus on building robust Retrieval-Augmented Generation (RAG) pipelines and optimizing transformer-based architectures to solve complex problems",
"Architect RAG Pipelines: Develop and optimize end-to-end RAG systems for multimodal data, including document parsing, embedding strategies, and vector database management",
"LLM Implementation: Select, fine-tune, and deploy LLMs (OpenAI, Anthropic, Llama, etc.) using frameworks like LangChain or LlamaIndex",
"Model Optimization: Work with transformer architectures to improve inference speed and accuracy (quantization, pruning, or prompt engineering)",
"Data Engineering: Manage unstructured data workflows and high-dimensional vector search (e.g., Pinecone Qdrant, Weaviate, or Milvus)"
]
},
"job_max_salary": null,
"job_min_salary": null,
"job_description": "Company Description\n\nDimaag is a leading design and technology company that specializes in AI solutions across multiple industry verticals including Smart Factory. Established in 2018 and headquartered in Silicon Valley with offices in Osaka, Japan, and Bangalore, India, Dimaag's EV business unit has a strong presence in deployed cutting edge industry solutions through its proprietary ENCORE ecosystem of EV components and charging solutions. Join Dimaag in its mission to create sustainable, high-performance technology for a better future.\n\nRole Description\n\nThis is a full-time, on-site role for an AI Engineer (Generative AI & RAG Specialist) based in Fremont, CA. The AI Engineer will focus on building and optimizing state-of-the-art generative AI and retrieval-augmented generation (RAG) models. This individual will design and deploy scalable production systems using Large Language Models (LLMs). with a focus on building robust Retrieval-Augmented Generation (RAG) pipelines and optimizing transformer-based architectures to solve complex problems.\n\nKey Responsibilities\n\nArchitect RAG Pipelines: Develop and optimize end-to-end RAG systems for multimodal data, including document parsing, embedding strategies, and vector database management.\n\nLLM Implementation: Select, fine-tune, and deploy LLMs (OpenAI, Anthropic, Llama, etc.) using frameworks like LangChain or LlamaIndex.\n\nModel Optimization: Work with transformer architectures to improve inference speed and accuracy (quantization, pruning, or prompt engineering).\n\nData Engineering: Manage unstructured data workflows and high-dimensional vector search (e.g., Pinecone Qdrant, Weaviate, or Milvus).\n\nRequired Skill Set\n\nCore AI: Deep understanding of the Transformer architecture (Attention mechanisms, encoders/decoders).\n\nFrameworks: Proficiency in PyTorch or TensorFlow, and orchestration tools like LangChain.\n\nVector DBs: Hands-on experience with vector similarity search and indexing.\n\nProgramming: Expert-level Python and experience with API integration.\n\nDeployment: Familiarity with cloud AI services (AWS Bedrock, GCP Vertex AI, or Azure AI).\n\nPreferred Qualifications\n\nExperience with fine-tuning techniques like LoRA or QLoRA.\n\nContributions to open-source AI projects or research publications.\n\nKnowledge of evaluation frameworks for LLMs (e.g., RAGAS or TruLens).",
"job_google_link": "https://www.google.com/search?ibp=htl;jobs&q&htidocid=GLjIb6nCF94zVt0aAAAAAA%3D%3D&hl=en-GB&shndl=37&shmd=H4sIAAAAAAAA_xWNMQrCQBBFsc0RrAYEUZGsCDZaBQ0hhY3BOqxx3KysM8vOIim9g6fwWp7EtfnFg_9e9hllZVFDScYSYoBZhYRBR_tESHwKp6KCxmNntbMS5_B9veHOFxDUoeuBCSpm43C862P0slVKxOVGYnJ0eccPxYQXHlQ6yX9a6XVA73TEdr1ZDbkns5gc6mORSpZgn0I3DmT1Es5kI16hSTKUH6XHyM6sAAAA&shmds=v1_ATWGeePBg_px-R8F447v3zPzy-psuIH7qUM1ORdN2ChzBV-i3Q&source=sh/x/job/li/m1/1#fpstate=tldetail&htivrt=jobs&htiq&htidocid=GLjIb6nCF94zVt0aAAAAAA%3D%3D",
"employer_website": null,
"job_onet_job_zone": "5",
"job_salary_period": null,
"job_apply_is_direct": false,
"job_employment_type": "Full–time",
"job_employment_types": [
"FULLTIME"
],
"job_posted_at_timestamp": 1772755200,
"job_posted_at_datetime_utc": "2026-03-06T00:00:00.000Z"
}