About Atman Labs
At Atman Labs we are building software to emulate proactive human expertise. Emulating true human intelligence and interaction with deep knowledge and proactive assistance has largely been impossible to do via standalone Artificial Intelligence techniques. Our unique research lies at the intersection of custom Reinforcement Learning environments, Large Scale Knowledge Representation, and Knowledge Presentation, is inspired by biological priors, and will be evolved and compounded with commercial application. As an applied research and commercialization company, we are deploying our platform in products across a number of commercial domains to demonstrate the value of our approach – starting with building proactive shopping concierges for e-commerce, to eventually launching products across travel, healthcare, education and more.

The Next Frontier of Generative Models: Unlocking Human Engagement and New Consumer Behavior with Advanced Multi-Modal Interactions
We are hiring for a research engineer responsible for implementing and fine-tuning an ensemble of state-of-the-art image generation models such as GANs, diffusion models, and text generation models like LLMs. Human imagination is critical as the connective tissue between stored knowledge in the brain and vocalization in speech, using media like language, art, and sound to share and receive ideas. Our intelligent agents have a number of ways they can interact with the world and other humans – they can imagine and communicate not just with language, but also by synthesizing immersive images that serve the task at hand.

As such, we seek to explore the frontiers of how agents can generate and present information to humans across images, text, and video from stored knowledge, going beyond what human imagination can achieve in form factor and latency. We can also create new visual experiences to interact with agents, such as video avatars.

You will be responsible for the cutting edge deployment and fine-tuning of generation algorithms across multiple modalities: image, video, voice, and text. You will be familiar with ways that generative algorithms can be more performant and fine-tuned to specific knowledge and use cases. You will have technical experience or intuition to work with generative models with some understanding of the current state of the art across GANs, Diffusion models, NeRFs, Language and Visual Foundation Models, excited to imagine the cutting-edge of the interaction layer our agents have with the world.

About You
We are looking for ambitious and independent thinkers who have a deep desire to contribute and want to be part of the team that makes this a reality for humanity. You should have all of these qualities:

You have a PhD in Computer Science or equivalent Industrial expertise or practical in the training, deployment, and fine-tuning of visual and language generative models, including GANs, NeRFs, diffusion models and LLMs.
You have a strong creative imagination and high visual and linguistic editorial standards for presentation of generated objects, which complements your technical intuition. You are both a hacker and a painter.
You are eager to process and analyze large amounts of multimodal data.
You are intimately familiar with the nuances for fine-tuning generative models on knowledge, resolution, presentation, and computational efficiency across visual and language dimensions and are excited about deploying them into product experiences.
You understand and can critically communicate about state-of-the art tools and frameworks to optimize model performance, including DeepSpeed, LoRA, 3D parallelism, or quantization.
You have 7+ years of programming experience in Python and have development experience with both DL toolkits like PyTorch or Tensorflow and can deploy models with clean APIs. You are equally capable as a software engineer as you are in formulating novel research ideas and your code proves it.
You are capable of reasoning from first-principles, where there is no trodden path, as well as critically evaluate when existing ideas are worth considering.
You are articulate and can present your ideas in writing, in person, and in small groups, and are able to educate audiences at all levels on the novel applications and relevance of reinforcement learning.
You can easily distinguish authentic and high integrity thinkers from ‘posers’, while also critically evaluating truth from fiction in your own work.
Your colleagues consider you a highly positive personality, you amplify the energy of others rather than dampen the mood.
Your intensity goes from 0 to 1000 when you become authentically interested in a topic.
You not only have interests in reinforcement learning, but are deeply curious about a range of interdisciplinary topics, ranging from knowledge graphs, recommendations, web-scale search, deep learning, generative AI models, computer vision and the opportunity to build truly intelligent systems in software that are inspired by biology.
You can show high creativity and intensity in your personal pursuits, and your intelligence, creativity, and motivation is not limited to only one discipline.
You consider yourself an innovator and an original thinker, not a follower. You are looking for a way to contribute to the world, and want to join our team to do so.
You want to work in person in London. Don’t worry, we’ll sponsor your visa.

This job is no longer accepting applications

See open jobs at Atman Labs.See open jobs similar to "Founding Engineer: Visual Synthesis and Generative Algorithms" Speedinvest.

See more open positions at Atman Labs

Privacy policy Cookie policy