Data Engineer (AI track)

Build and scale the next generation of product security.

Location

Remote

Job Type

Full-Time

Experience Level

Mid-Senior

Data Engineer

We’re looking for a Data Engineer who’s hungry to grow, especially into the world of AI. If you’re driven, curious, and looking to make the leap from data infrastructure into machine learning and large language models (LLMs), this is your chance.
You’ll work closely with our founding team to develop systems that extract, transform, and model complex codebases into graph-based architectural representations. Long term, you’ll help us push boundaries in semantic code understanding, inference models, and secure-by-design AI tooling.

Responsibilities

  • Build inference pipelines: Build LLM-powered workflows to analyze, search, and improve codebases and natural language documents.
  • Implement AI data infrastructure: Design, manage, and operate data pipelines to support AI-powered features.
  • Optimize AI systems: Lead the design and implementation of systems to fine-tune, benchmark, and optimize LLMs for security use cases.
  • Integrate with modern platforms: Connect with tools like Google Docs, Confluence, Jira, GitHub, and process PDFs to ingest development artifacts.

Required Skills & Experience

  • 5+ years of experience building scalable data systems or data-intensive applications
  • Advanced Python proficiency, particularly with asynchronous programming (asyncio)
  • Experience designing and deploying data pipelines in cloud environments (AWS preferred)
  • Proficient in SQL (PostgreSQL) and experience with vector databases (e.g., pg_vector)
  • Experience with AI model APIs (OpenAI, Anthropic, Hugging Face) and embedding-based retrieval
  • Familiar with performance and scalability considerations in data systems (caching and orchestration)
  • Comfortable with containerized development (Docker) and infrastructure-as-code (Terraform)
  • Solid grasp of modern engineering practices: CI/CD, testing, and agile workflows

Bonus Qualifications

  • Experience with LLM-related concepts: fine-tuning, prompt engineering, hallucination reduction
  • Exposure to semantic code analysis, AST parsing, or graph-based code modeling
  • Familiarity with serverless architectures (e.g., AWS Lambda)
  • Contributions to open-source AI or data infrastructure projects

About DevArmor

DevArmor is a venture-backed startup founded by repeat entrepreneurs and security veterans with successful exits. Backed by top-tier investors and advised by leaders from Sierra.ai, Databricks, Netflix, Mozilla, and Semgrep, we're reimagining how secure software gets built.

As an Okta executive puts it:

"DevArmor bakes security into the earliest design decisions—exactly what our industry needs to stay ahead. The team brings deep experience and real empathy for developers. I’m excited to support their mission to make security a first-class part of building software."

Suchit Agarwal

Director of Engineering at Okta

Why Join DevArmor?

  • Tackle real challenges: Help solve the hard problem of scaling security without slowing down dev teams.
  • Work with the best: Join a small, expert team in security, AI, and dev tools.
  • Own your work: Shape the product, roadmap, and culture from day one.
  • Build the future: Work at the frontier of AI, code analysis, and secure-by-design systems.
  • We’ve got your back: Competitive pay, great benefits, and flexible work.