ALESIUM ← PROPOSAL INVEST Start →
◆ EST. 2008 AS PROPOSAL INVEST◆ CONTINUING 2021 AS ALESIUM◆ LIMASSOL · CY◆ DEPOSITPHOTOS + CRELLO → CIMPRESS 2021◆ OPEN FOR Q3 ENGAGEMENTS◆ HIRING: SENIOR ENGINEERS◆ ANTIFRAUD · MARKETPLACES · REAL ESTATE · R&D◆ EST. 2008 AS PROPOSAL INVEST◆ CONTINUING 2021 AS ALESIUM◆ LIMASSOL · CY◆ DEPOSITPHOTOS + CRELLO → CIMPRESS 2021◆ OPEN FOR Q3 ENGAGEMENTS◆ HIRING: SENIOR ENGINEERS◆ ANTIFRAUD · MARKETPLACES · REAL ESTATE · R&D◆ EST. 2008 AS PROPOSAL INVEST◆ CONTINUING 2021 AS ALESIUM◆ LIMASSOL · CY◆ DEPOSITPHOTOS + CRELLO → CIMPRESS 2021◆ OPEN FOR Q3 ENGAGEMENTS◆ HIRING: SENIOR ENGINEERS◆ ANTIFRAUD · MARKETPLACES · REAL ESTATE · R&D◆ EST. 2008 AS PROPOSAL INVEST◆ CONTINUING 2021 AS ALESIUM◆ LIMASSOL · CY◆ DEPOSITPHOTOS + CRELLO → CIMPRESS 2021◆ OPEN FOR Q3 ENGAGEMENTS◆ HIRING: SENIOR ENGINEERS◆ ANTIFRAUD · MARKETPLACES · REAL ESTATE · R&D
↖ JOBS / HOOH

Applied AI Engineer (Document Intelligence & Prompting)

◆ UNITED STATES / REMOTE ◆ FULL-TIME, REMOTE ◆ HOOH

About

Hooh is an AI-first startup turning cutting-edge models into real products. We love turning research into usable tools that solve real problems. If you thrive on ownership and impact — you’ll fit right in.

We’re looking for an Applied AI Engineer to make messy documents reliably understandable — combining accurate structure and entity extraction with strong retrieval and cost-efficient model orchestration.

Key responsibilities

  • Document analysis & taxonomy — analyse layouts, sections, entities, and relations; define categories, types, and metadata schemas that generalise across domains
  • Prompting & schemas — author, version, and maintain prompts/chains for classification, extraction, summarisation, and Q&A; enforce JSON/JSON-Schema outputs
  • Retrieval (RAG) — build hybrid retrieval (embeddings + keyword + graph traversal); manage embedding generation, indexing, deduplication, and drift
  • Cascades & MoE — architect model cascades and MoE routing to balance quality, cost, and latency
  • Agentic orchestration — coordinate multi-step pipelines with caching, batching, streaming, and robust retries
  • Result fusion — combine outputs via reranking, voting, and confidence/consistency checks; implement guardrails and safety rules
  • Evaluation & monitoring — run evals (LangSmith or similar), define golden sets, automate regressions, track quality/cost/latency; canary and observe
  • Operate models locally or hosted — vLLM / TGI / Ollama / llama.cpp and API models; apply quantisation/LoRA when useful
  • Data stores — vector DBs plus graph and relational DBs (Qdrant, Weaviate, Milvus, pgvector, Neo4j, PostgreSQL)

Requirements

  • Strong programming in Python or JavaScript/TypeScript; clear, tested, maintainable code
  • One LLM-powered feature shipped end-to-end (prototype → production), or equivalent open-source/portfolio work
  • Hands-on with RAG, vector embeddings, and evaluation (offline + online A/B, error analysis)
  • Familiarity with model cascading / MoE concepts (or ability to learn quickly)
  • Practical database skills; comfortable with SQL and with graph or vector systems
  • Product mindset with a bias to measure impact and iterate

Interested? Drop us a line.

◆ APPLICATIONS ARE PROCESSED ONLY THROUGH THE FORM ABOVE