Back to all positions

    RL AI Research Scientist

    ResearchRemote (US/Singapore Preferred)Full-time

    Design, implement, and scale novel reinforcement learning algorithms that form the core of Pokee's AI agent platform.

    About the Role

    As an RL Research Scientist, you will design, implement, and scale novel reinforcement learning algorithms that form the core of Pokee’s AI agent platform. You’ll work at the frontier of RL applied to real-world enterprise tasks—developing methods for context selection, long-horizon planning, and reward shaping that enable agents to operate reliably at scale.

    What You'll Do

    • Design and implement novel RL algorithms for training AI agents on complex, multi-step enterprise workflows
    • Develop and refine reward modeling, context selection, and policy optimization techniques that improve agent accuracy over extended task horizons
    • Run large-scale experiments, analyze results rigorously, and translate research findings into production-ready components
    • Collaborate closely with infrastructure engineers to ensure research prototypes scale efficiently on both cloud and on-device hardware
    • Contribute to the company’s intellectual property through publications, patents, and open-source contributions
    • Stay current with the latest advances in RL, LLM fine-tuning, and AI agent architectures, and propose new research directions

    What We're Looking For

    Required

    • PhD (or equivalent research experience) in Reinforcement Learning, Machine Learning, or a closely related field
    • Strong publication record at top venues (NeurIPS, ICML, ICLR, AAAI, or equivalent)
    • Deep expertise in RL fundamentals: policy gradient methods, value-based methods, model-based RL, multi-agent RL, or RLHF/RLAIF
    • Proficiency in Python and at least one deep learning framework (PyTorch strongly preferred)
    • Experience training and fine-tuning large language models is a significant plus
    • Demonstrated ability to take research from prototype to production

    Bonus Points

    • Experience with on-device or edge inference optimization (quantization, distillation, MoE architectures)
    • Familiarity with enterprise software deployment, compliance, or regulated industries
    • Track record of open-source contributions in RL or LLM ecosystems
    • Experience with distributed training at scale (FSDP, DeepSpeed, Megatron)

    Who You Are

    You want to join a small, elite team solving one of the hardest problems in AI—building agents that actually work in the real world. You’ll have direct impact on the product, access to cutting-edge research, and the opportunity to shape the future of enterprise AI from the ground up.

    Apply for RL AI Research Scientist

    Ready to join our team? Fill out the form below to apply.

    How did you hear about this opportunity? (Select all that apply)
    Follow on LinkedIn
    Pokee Logo

    Pokee AI

    Frontier Agent Deployed in Your Infrastructure.

    Solutions

    FinanceHealth CareE-commerceEducationManufacturing

    Company

    CareersSecurityContact Us

    Resources

    API DocumentationBlogFAQ

    Legal

    Terms of ServicePrivacy PolicyAccessibilitySystem Status

    Follow Us

    TwitterLinkedInRedditDiscord

    © 2026 Pokee AI. All rights reserved.

    Terms & ConditionsPrivacy Policy
    Pokee in CB Insights AI 100Pokee named to CB Insights' 10th Annual AI 100 ListRead more→
    Pokee AI
    • FinanceCompliance, onboarding, document review
    • Health CareClinical workflows, EHR, care coordination
    • E-commerceCatalog, support, post-purchase ops
    • EducationAdmissions, grading, student support
    • ManufacturingPredictive maintenance, supply chain
    Finance
    Health Care
    E-commerce
    Education
    Manufacturing
    Request a Demo
    • Security
    • Deployment
    • Agent Infrastructure
    • FAQ
    • Blog
    • Playbook
    • Integrations
    • Skills Marketplace
    • Developer Program
    • Pricing
    • Careers
    • Contact
    Demo
    Enterprise Demo
    • Finance
    • Health Care
    • E-commerce
    • Education
    • Manufacturing
    • Security
    • Deployment
    • Agent Infrastructure
    • FAQ
    • Blog
    • Playbook
    • Integrations
    • Skills Marketplace
    • Developer Program
    • Pricing
    • Careers
    • Contact
    Request a Demo