- Accueil
- Emploi Télétravail
- Senior AI/ML Engineer, France
Déjà pourvu
Ne rate pas la prochaine offre. Reçois des postes adaptés directement par e-mail.
vector8
Senior AI/ML Engineer, France
Résumé du poste
Modèle de travail
Job Description
As a Senior AI/ML Engineer at vector8, you will design, implement, and deploy AI solutions that bridge the gap between research and production. Your work will focus on integrating and fine-tuning AI Models, optimizing the model performance, and ensuring enterprise-grade reliability, security, and scalability.
This Is a Hands-on Engineering Role Where You Will
- Develop and optimize LLM and VLM-powered solutions for enterprise use cases
- Develop and optimize TTS, STT and ML models.
- Apply software engineering best practices (testing, CI/CD, modular design, documentation)
- Collaborate with cross-functional teams (data engineers, MLOps, cloud architects, and business stakeholders)
- Solve real-world enterprise challenges (security, compliance, legacy system integration)
- Own the full lifecycle of AI models, from data exploration to production monitoring
You will work closely with vector8's Engineers and Project Managers to co-design AI foundations that enable organizations to scale AI from individual use cases to enterprise-wide capabilities.
The role is primarily based in Paris, with occasional travel to client sites and collaboration with teams across Europe.
Job Requirements
- 5+ years of experience in AI/ML engineering, software development, or a related field
- Expertise in LLM architectures and training methodologies:
- Transformers, attention mechanisms, fine-tuning, RAG, quantization
- Prompt engineering, model evaluation, bias detection
- Strong knowledge of machine learning architectures: fully connected, CNN, LSTM, transformers and classical ML models
- Strong software engineering skills:
- Proficient in Python (FastAPI, Pydantic, asyncio, type hints)
- Experience with API development
- Familiarity with modern tool chains (Docker, Kubernetes, Terraform)
- Hands-on experience with LLM integrations:
- LLM providers
- Vector databases (Pinecone, Weaviate, Milvus)
- Model serving (vLLM, TGI, KServe)
- Experience with MLOps and production deployments
- Understanding of enterprise challenges:
- Security, compliance, scalability, cost optimization.
- Experience with relational and non-relational databases.
- Strong problem-solving and debugging skills.
- Excellent communication and collaboration skills (fluent in English; German is a strong plus).
- Bachelor's or Master's degree in Computer Science, Mathematics, Physics, or a related field.
- Experience with multi-cloud environments (AWS, Azure, GCP).
- Experience with code optimization (e.g., model quantization, parallelization).
Job Responsibilities
End-to-End Model Development
- Design, implement, and deploy distributed, high-volume, high-performance, low-latency machine learning solutions, with a focus on GenAI models, and especially LLM integrations and API-driven architectures
- Take ownership of your models throughout their entire lifecycle:
- Data exploration and cleaning to build reproducible, versioned datasets
- State-of-the-art research to identify the best architectures for the problem (e.g., transformers, RAG, fine-tuning)
- Implementation, training, and optimization in reproducible environments
- Deployment, monitoring, and maintenance in production
- Optimize models for performance, latency, and cost efficiency, especially in LLM serving and inference
Software Engineering for AI
- Write clean, modular, and well-documented code in Python (FastAPI, Pydantic, asyncio)
- Apply best practices in:
- Testing (unit, integration, end-to-end)
- CI/CD (GitHub Actions, GitLab CI, ArgoCD)
- Observability (logging, monitoring, tracing)
- Ensure security and compliance (data protection, access controls, encryption)
- Integrate models and code into CI/CD pipelines for seamless deployment
AI & ML Integration & API Development
- Design and implement AI-powered solutions that integrate with APIs, microservices, and event-driven architectures
- Develop and optimize AI pipelines for:
- Dataset cleaning, preprocessing and model training
- Fine-tuning (domain adaptation, instruction tuning)
- Retrieval-Augmented Generation (RAG) (vector databases, semantic search)
- Prompt engineering (optimizing inputs for performance, cost, and accuracy)
- Model evaluation (benchmarking, bias detection, drift analysis)
- Build scalable, secure, and cost-efficient serving infrastructure (e.g., FastAPI, vLLM)
- Debug and optimize performance (latency, throughput, token efficiency for Transformer based architectures)
Enterprise AI & MLOps
- Deploy and monitor AI models in production
- Design and implement MLOps pipelines for:
- Model training, fine-tuning, and evaluation
- Model versioning and lineage tracking
- A/B testing and canary deployments
- Ensure scalability and reliability (auto-scaling, fault tolerance, disaster recovery)
- Collaborate with data engineers to build data pipelines (batch, streaming, real-time)
Collaboration & Technical Leadership
- Work closely with product owners, DevOps, and quality assurance in an agile, cross-functional team
- Mentor junior engineers and promote best practices in AI/ML and software engineering
- Translate product requirements into technical solutions and architectural decisions
- Document architectures, decisions, and best practices for internal and client-facing use
- Develop relationships with internal and external stakeholders, including clients and partners
Innovation & Continuous Improvement
- Stay ahead of the latest AI and ML architectures (transformers, Mixture of Experts, sparse attention).
- Experiment with cutting-edge techniques (quantization, distillation, speculative decoding).
- Evaluate and benchmark open-source and proprietary models (Llama, Mistral, Mixtral, GPT-4, Claude).
- Bring your own ideas through vector8's ideation process.
- Contribute to vector8's AI accelerators (reusable components for common industry problems).
- Embrace a strategic and continuous improvement mentality to drive innovation.
Job Benefits
- A great compensation package with competitive benefits, including:
- Flexible working hours, including remote work options (hybrid model).
- 25 days of paid vacation per year, plus additional flex days.
- Private health and life insurance & pension plan for long-term security.
- Home office allowance & Lunch vouchers to enjoy meals on us.
- Discounted fitness memberships to stay active.
- 50% reimbursement of public transport costs.
- Free coffee, fruit, and snacks to keep you fueled.
- Access to the latest technologies (LangDock, Claude Code for developers)
- Grants for training, coaching, and conferences to support your continuous learning.
- Opportunities to attend industry events and represent vector8 as a thought leader.
- A less-formal work environment where authenticity and collaboration thrive.
- A diverse and inclusive team that values curiosity, ownership, and innovation.