このポジションについて

As an RL Research Scientist, you will design, implement, and scale novel reinforcement learning algorithms that form the core of Pokee’s AI agent platform. You’ll work at the frontier of RL applied to real-world enterprise tasks—developing methods for context selection, long-horizon planning, and reward shaping that enable agents to operate reliably at scale.

担当していただくこと

Design and implement novel RL algorithms for training AI agents on complex, multi-step enterprise workflows
Develop and refine reward modeling, context selection, and policy optimization techniques that improve agent accuracy over extended task horizons
Run large-scale experiments, analyze results rigorously, and translate research findings into production-ready components
Collaborate closely with infrastructure engineers to ensure research prototypes scale efficiently on both cloud and on-device hardware
Contribute to the company’s intellectual property through publications, patents, and open-source contributions
Stay current with the latest advances in RL, LLM fine-tuning, and AI agent architectures, and propose new research directions

求める人物像

必須要件

PhD (or equivalent research experience) in Reinforcement Learning, Machine Learning, or a closely related field
Strong publication record at top venues (NeurIPS, ICML, ICLR, AAAI, or equivalent)
Deep expertise in RL fundamentals: policy gradient methods, value-based methods, model-based RL, multi-agent RL, or RLHF/RLAIF
Proficiency in Python and at least one deep learning framework (PyTorch strongly preferred)
Experience training and fine-tuning large language models is a significant plus
Demonstrated ability to take research from prototype to production

歓迎要件

Experience with on-device or edge inference optimization (quantization, distillation, MoE architectures)
Familiarity with enterprise software deployment, compliance, or regulated industries
Track record of open-source contributions in RL or LLM ecosystems
Experience with distributed training at scale (FSDP, DeepSpeed, Megatron)

こんな方を求めています

You want to join a small, elite team solving one of the hardest problems in AI—building agents that actually work in the real world. You’ll have direct impact on the product, access to cutting-edge research, and the opportunity to shape the future of enterprise AI from the ground up.

このポジションについて

担当していただくこと

Design and implement novel RL algorithms for training AI agents on complex, multi-step enterprise workflows
Develop and refine reward modeling, context selection, and policy optimization techniques that improve agent accuracy over extended task horizons
Run large-scale experiments, analyze results rigorously, and translate research findings into production-ready components
Collaborate closely with infrastructure engineers to ensure research prototypes scale efficiently on both cloud and on-device hardware
Contribute to the company’s intellectual property through publications, patents, and open-source contributions
Stay current with the latest advances in RL, LLM fine-tuning, and AI agent architectures, and propose new research directions

求める人物像

必須要件

PhD (or equivalent research experience) in Reinforcement Learning, Machine Learning, or a closely related field
Strong publication record at top venues (NeurIPS, ICML, ICLR, AAAI, or equivalent)
Deep expertise in RL fundamentals: policy gradient methods, value-based methods, model-based RL, multi-agent RL, or RLHF/RLAIF
Proficiency in Python and at least one deep learning framework (PyTorch strongly preferred)
Experience training and fine-tuning large language models is a significant plus
Demonstrated ability to take research from prototype to production

歓迎要件

Experience with on-device or edge inference optimization (quantization, distillation, MoE architectures)
Familiarity with enterprise software deployment, compliance, or regulated industries
Track record of open-source contributions in RL or LLM ecosystems
Experience with distributed training at scale (FSDP, DeepSpeed, Megatron)

RL AI Research Scientist

このポジションについて

担当していただくこと

求める人物像

必須要件

歓迎要件

こんな方を求めています