Senior Manager, LLMOps & AI Integration Engineer

Pfizer

Προβολή 8 θέσεων

ΘεσσαλονίκηΤοποθεσία

Θεσσαλονίκη

πριν μερικά δευτερόλεπτα

Ημ. ανάρτησης

πριν μερικά δευτερόλεπτα

Με μεγάλη εμπειρίαΕπίπεδο εμπειρίας

Με μεγάλη εμπειρία

Πλήρης απασχόλησηΤύπος απασχόλησης

Πλήρης απασχόληση

ΠληροφορικήΚατηγορία θέσης

Πληροφορική

Εργασία από απόσταση κάποιες ημέρεςΤηλεργασία

Εργασία από απόσταση κάποιες ημέρες

As Senior Manager, LLMOps & AI Integration Engineer (Individual Contributor), you will lead the model integration, agent/tooling, and LLM operations strategy for our pip‑installable AI Capabilities SDK. You will design and maintain clean, extensible interfaces for LLMs, embeddings, retrieval, tool execution, and agentic workflows; build runtime guards and evaluation hooks; and optimize latency, reliability, and cost—so teams can embed AI safely and predictably.

Why this Role Matters

LLMOps sits at the intersection of capability, safety, and operability. By turning complex model/agent behaviors into well-abstracted, evaluable, and observable SDK modules, you enable teams to deliver AI features fast with safety—reducing bespoke implementations, ensuring reproducibility, and controlling latency/cost at scale. Your work is foundational for platform services that rely on the SDK as their backbone.

ROLE RESPONSIBILITIES

1) Model & Agent Integration Architecture

Design Python SDK interfaces for LLM clients, embeddings, tokenization, and structured outputs (e.g., Pydantic/JSON schemas).
Implement function/tool calling abstractions, agent orchestration patterns (ReAct-like, planner/executor), and MCP adaptors for interoperable tools.
Provide configuration and secrets integration points that downstream services can adopt consistently (env vars, config schemas, key management hooks).

2) Retrieval & RAG Components

Build RAG modules (document loaders, chunking/segmentation, embeddings, retrievers, rerankers) with pluggable backends (vector stores, search indices).
Standardize connectors and interface contracts for common data sources; ensure reproducible pipelines and SDK↔service parity of behaviors.
Optimize retrieval quality via evaluation hooks (precision/recall, MRR, hit rate) and tunable parameters (top‑k, thresholds).

3) Guardrails, Safety & Policy Integration

Codify safety guardrails: prompt hygiene, jailbreak resistance, content filters, sensitive topic/policy checks, red‑team/adversarial probes.
Integrate privacy/security controls: data minimization, PII detection flags, safe logging, and deterministic truncation strategies.
Provide policy‑as‑code hooks and pre/post‑processing middleware to enforce runtime constraints within SDK flows.

4) Performance, Reliability & Cost Engineering

Implement caching, batching, retry/backoff, circuit breakers, and fall‑back strategies across model/tool calls.
Profile and tune latency, throughput, and token usage/cost; expose configuration knobs and budget guards (token and cost caps).

5) Observability‑by‑Design for Libraries

Embed telemetry hooks (structured logs, metrics, traces) suitable for a library so that services consuming the SDK can attach enterprise observability.
Define SLIs relevant to LLMOps (latency, error rates, cache hit ratio, token usage, cost per call) and document runbook guidance for downstream teams.

6) Evaluation & CI/CD Integration

Partner with Evaluation & QA Engineers to build test harnesses (unit/integration/contract/fuzz/adversarial) and benchmark suites for agents/tools, RAG, and model endpoints.
Work with DevOps Engineers to wire quality gates into CI/CD: coverage thresholds, performance budgets, safety checks, SBOM/signing for release artifacts.
Provide fixtures and simulators to enable deterministic tests across backends (vector stores, providers).

7) Developer Experience & Enablement

Collaborate with Developer Experience and Technical Writers to deliver quickstarts, sample apps, notebooks, and code recipes for common patterns (RAG, agents, tool use, structured outputs).
Contribute to cookiecutters/CLI scaffolds that prewire configuration, tests, telemetry, and guardrails for new modules or integrations.

8) Collaboration & Continuous Improvement

Close feedback loops with Embedded Architecture, Solution Design, AI Engineering, Creation Centers, and FITs to refine APIs, defaults, and ergonomics.
Participate in post‑release retrospectives and incident reviews to strengthen guardrails, performance tuning, and developer experience.

MEASURES OF SUCCESS

Performance & cost: Latency and token/cost budgets consistently met; cache hit ratios improved; fewer performance regressions.
Safety & reliability: Guardrail tests pass; reduced policy violations and incident rates; predictable error handling.
Adoption & DX: Faster time‑to‑first‑success for consuming teams; positive developer feedback on APIs and examples.

QUALIFICATIONS

Basic Qualifications

7+ years in software/AI engineering with significant experience in LLMOps or model/tool integration for Python libraries.
Deep proficiency in Python and building extensible SDKs (interfaces, packaging, semantic versioning, backward compatibility).
Hands‑on with GenAI/LLM ecosystems (e.g., tool/function calling, structured outputs, prompt orchestration) and RAG components (embeddings, retrievers, rerankers, vector stores).
Strong grasp of performance engineering (profiling, caching, batching), resilience patterns (retry/backoff, circuit breakers), and cost control (token budgeting).
Experience embedding observability hooks and integrating evaluation/benchmarks into CI/CD pipelines.

Preferred Qualifications

Familiarity with agent frameworks, MCP adaptors, and multi‑provider model client abstractions.
Experience with adversarial/safety testing, privacy/PII considerations, and policy‑as‑code guardrails.
Background in open-source workflows (feature branches, PR reviews) and multi‑backend integration (vector DBs, search, storage).
Exposure to regulated environments (healthcare/pharma/finance) and audit‑ready evidence capture.
Comfortable collaborating across architecture, DevOps, QA, DX, platform, and product teams in matrixed organizations.

Work Location Assignment: Hybrid

Σχετικά tags

engineer

pfizer

ΠΕΡΙΛΗΨΗ ΘΕΣΗΣ

Senior Manager, LLMOps & AI Integration Engineer

Pfizer

Θεσσαλονίκη

πριν μερικά δευτερόλεπτα

Με μεγάλη εμπειρία

Πλήρης απασχόληση

Senior Manager, LLMOps & AI Integration Engineer