LOOKING FOR A NEW ADVENTURE?

Speedinvest and our portfolio of startups are always hiring exceptional talent!
Browse open jobs below to find your next career move.
226
companies
1,178
Jobs

Founding Engineer: Reinforcement Learning and Intelligent Strategization

Atman Labs

Atman Labs

London, UK
Posted on Tuesday, January 23, 2024

About Atman Labs
At Atman Labs we are building software to emulate proactive human expertise. Emulating true human intelligence and interaction with deep knowledge and proactive assistance has largely been impossible to do via standalone Artificial Intelligence techniques. Our unique research lies at the intersection of custom Reinforcement Learning environments, Large Scale Knowledge Representation, and Knowledge Presentation, is inspired by biological priors, and will be evolved and compounded with commercial application. As an applied research and commercialization company, we are deploying our platform in products across a number of commercial domains to demonstrate the value of our approach – starting with building proactive shopping concierges for e-commerce, to eventually launching products across travel, healthcare, education and more.

The Next Frontier of Reinforcement Learning Applications: Emulating Human Expertise and Consciousness
We are hiring for a founding engineer that will be responsible for advancing the state of the art in reinforcement-learning as it relates to building proactive agent interactions with humans that mimic the intelligence and dynamism of human experts. As a specialist in reinforcement learning rooted in first-principled, biological thinking, you are able to discern that biological systems have infinite sequences of actions and rewards that can be represented by a custom RL environment, regardless of whether perfect information is present. While we can emulate basic human interaction through a simply-defined environment, you realize and are excited by the fact that eventually the action space will be highly complex, hierarchical, and span many sub-actions that aren’t defined clearly.

You will solve cutting-edge problems that connect reinforcement learning rewards to knowledge presentation and recommendation modules, build complex and hierarchical environments from scratch to emulate specific expert actions, and allow agents to formulate intelligent strategies to determine the sequences of those actions that drive human engagement in both the short-term and on a longer-term horizon.

About You
We are looking for ambitious and independent thinkers who have a deep desire to contribute and want to be part of the team that makes this a reality for humanity. You should have all of these qualities:

  • You have a PhD or equivalent industrial expertise in the application of reinforcement learning with knowledge of the limitations and frontiers of the field.

  • You have demonstrated expertise in designing, testing, and deploying end-to-end reinforcement learning solutions, and can help set the guardrails for both experimentation and production deployment.

  • You are deeply proficient in various policy- or value-based RL methods, from Proximal Policy Optimization (PPO), Deep Q Networks (DQN), and/or Monte Carlo methods, and can strategically experiment with and select between various methods to fit the current situation or intended behavior.

  • You have 7+ years of programming experience in Python and have development experience with both ML toolkits and RL environments like OpenAI Gymnasium. You are equally capable as a software engineer as you are in formulating novel research ideas and your code proves it.

  • You are capable of reasoning from first-principles, where there is no trodden path, as well as critically evaluate when existing ideas are worth considering.

  • You are articulate and can present your ideas in writing, in person, and in small groups, and are able to educate audiences at all levels on the novel applications and relevance of reinforcement learning.

  • You can easily distinguish authentic and high integrity thinkers from ‘posers’, while also critically evaluating truth from fiction in your own work.

  • Your colleagues consider you a highly positive personality, you amplify the energy of others rather than dampen the mood.

  • Your intensity goes from 0 to 1000 when you become authentically interested in a topic.

  • You not only have interests in reinforcement learning, but are deeply curious about a range of interdisciplinary topics, ranging from knowledge graphs, recommendations, web-scale search, deep learning, generative AI models, computer vision and the opportunity to build truly intelligent systems in software that are inspired by biology.

  • You can show high creativity and intensity in your personal pursuits, and your intelligence, creativity, and motivation is not limited to only one discipline.

  • You consider yourself an innovator and an original thinker, not a follower. You are looking for a way to contribute to the world, and want to join our team to do so.

  • You want to work in person in London. Don’t worry, we’ll sponsor your visa.