すべてのポジションに戻る

    RL AI Research Scientist

    ResearchRemote (US/Singapore Preferred)Full-time

    Design, implement, and scale novel reinforcement learning algorithms that form the core of Pokee's AI agent platform.

    このポジションについて

    As an RL Research Scientist, you will design, implement, and scale novel reinforcement learning algorithms that form the core of Pokee’s AI agent platform. You’ll work at the frontier of RL applied to real-world enterprise tasks—developing methods for context selection, long-horizon planning, and reward shaping that enable agents to operate reliably at scale.

    担当していただくこと

    • Design and implement novel RL algorithms for training AI agents on complex, multi-step enterprise workflows
    • Develop and refine reward modeling, context selection, and policy optimization techniques that improve agent accuracy over extended task horizons
    • Run large-scale experiments, analyze results rigorously, and translate research findings into production-ready components
    • Collaborate closely with infrastructure engineers to ensure research prototypes scale efficiently on both cloud and on-device hardware
    • Contribute to the company’s intellectual property through publications, patents, and open-source contributions
    • Stay current with the latest advances in RL, LLM fine-tuning, and AI agent architectures, and propose new research directions

    求める人物像

    必須要件

    • PhD (or equivalent research experience) in Reinforcement Learning, Machine Learning, or a closely related field
    • Strong publication record at top venues (NeurIPS, ICML, ICLR, AAAI, or equivalent)
    • Deep expertise in RL fundamentals: policy gradient methods, value-based methods, model-based RL, multi-agent RL, or RLHF/RLAIF
    • Proficiency in Python and at least one deep learning framework (PyTorch strongly preferred)
    • Experience training and fine-tuning large language models is a significant plus
    • Demonstrated ability to take research from prototype to production

    歓迎要件

    • Experience with on-device or edge inference optimization (quantization, distillation, MoE architectures)
    • Familiarity with enterprise software deployment, compliance, or regulated industries
    • Track record of open-source contributions in RL or LLM ecosystems
    • Experience with distributed training at scale (FSDP, DeepSpeed, Megatron)

    こんな方を求めています

    You want to join a small, elite team solving one of the hardest problems in AI—building agents that actually work in the real world. You’ll have direct impact on the product, access to cutting-edge research, and the opportunity to shape the future of enterprise AI from the ground up.

    RL AI Research Scientist に応募する

    チームに参加する準備はできましたか?下記のフォームにご記入のうえご応募ください。

    How did you hear about this opportunity? (Select all that apply)
    LinkedIn でフォローする
    Pokee Logo

    Pokee AI

    Frontier Agent Deployed in Your Infrastructure.

    ソリューション

    金融医療Eコマース教育製造

    会社情報

    CareersセキュリティContact Us

    リソース

    API 技術文書ブログFAQ

    法的情報

    Terms of ServicePrivacy PolicyAccessibilitySystem Status

    Follow Us

    TwitterLinkedInRedditDiscord

    © 2026 Pokee AI. All rights reserved.

    Terms & ConditionsPrivacy Policy