Research Scientist - Voice AI Foundations

Overview
Job Description

Deepgram is a foundational AI company building state of the art, production-ready AI models that streamline human-computer interaction and amplify productivity. By enabling seamless communication between humans and machines, we believe we can harness the untapped potential of AI and help pave the way for a more productive future. We passionately believe in the potential of audio data to transform lives, businesses, and interactions across the globe – which is why Deepgram is trusted by well-respected companies like NASA, Twilio, Auth0, and Spotify to push the boundaries of what is possible in voice technology!

The Opportunity:

At Deepgram, we spend every day tackling big, real-world challenges in voice. Our customers hire us to solve their hardest problems, taking real, complex audio and transforming it into novel insights. And to raise the bar, everything we build needs scale in its DNA. We aren’t content with simple horizontal scaling: we intend to replace entire data centers dedicated to speech analytics with a single rack of servers. These challenges demand creativity and innovative problem-solving every day.

As a Research Scientist at Deepgram, you’ll have the freedom to explore and uncover breakthroughs. You’ll also have a mandate to build — applying the latest advancements in deep learning to develop accurate and performant voice AI models.  You will collaborate with product & engineering to help deploy these models in the most scalable speech API on the planet.  We look forward to you bringing your whole self to work, sharing learnings from your latest experiments, and collaborating with us to advance the state of AI and voice technology.

The Role:

Deepgram is currently looking for an experienced Research Scientist who has worked extensively on building models to solve hard problems in voice AI domains including automatic speech recognition (ASR), text-to-speech (TTS), diarization and speaker identification, language detection, or code switching. Voice AI is a challenging problem space which involves dealing with raw audio waveforms generated by the human voice.  The complexity of audio data poses unique infrastructure, engineering, and modeling challenges which are orders of magnitude more difficult than working with text. You should have extensive experience working on the hard technical aspects around deep learning for audio such as speech data curation and characterization, development of expressive and efficient neural network architectures for speech, distributed training at large-scales, and optimization of speech models for inference at scale.

What You’ll Do:

  • Stay up to date with the latest advances in deep learning with a particular eye towards their implications and applications within our products.
  • Design and carry out experimental programs to build new voice AI models that solve critical problems for our customers.
  • Drive large-scale training jobs successfully on distributed computing infrastructure.
  • Optimize model architecture to make them as fast and memory-efficient as possible; deploy new models into production for use at massive scale.
  • Document and present results and complex technical concepts clearly for internal and external audiences

You’ll Love This Role If You:

  • Are passionate about AI and excited about working on state of the art speech research
  • Enjoy building from the ground up and love to create new systems from scratch
  • Are obsessed with building and shipping practical solutions to real world problems
  • Are data-driven and prefer to solve problems using iterative experimentation
  • Have strong communication skills and are able to translate complex concepts in simple terms, depending on the target audience

It’s Important To Us That You Have:

  • Prior industry experience in building deep learning models to solve audio problems, with a solid understanding toward the applications and implications of different neural network types, architectures, and loss mechanisms.
  • Proven experience building models from a blank page and owning the entire deep learning stack including data curation, characterization and cleaning, architecture design and model building, distributed large-scale training, and model optimization for inference.
  • Strong software engineering skills with particular emphasis on developing clean, modular code in Python and working with Pytorch.
  • Prior experience in designing and conducting experimental programs with the ability to rapidly iterate and change course as needed.

It Would Be Great if You Had:

  • Deep understanding and experience working with state-of-the-art network architectures including transformers.
  • Experience building generative audio models for speech or music synthesis.
  • Understanding of different parallelism paradigms for efficient distributed training.
  • Up-to-date knowledge of recent techniques and developments in multiple voice AI problem domains (ASR, TTS, diarization, etc.)

Backed by prominent investors including Y Combinator, Madrona, Tiger Global, Wing VC and NVIDIA, Deepgram has raised over $85 million in total funding after closing our Series B funding round last year. If you’re looking to work on cutting-edge technology and make a significant impact in the AI industry, we’d love to hear from you!

Deepgram is an equal opportunity employer. We want all voices and perspectives represented in our workforce. We are a curious bunch focused on collaboration and doing the right thing. We put our customers first, grow together and move quickly. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, gender identity or expression, age, marital status, veteran status, disability status, pregnancy, parental status, genetic information, political affiliation, or any other status protected by the laws or regulations in the locations where we operate.

Receive jobs from , on your whatsapp

Stay up to date with job opportunities, directly on your WhatsApp!
Receive instant notifications about the latest job openings in your region

Anúncio

Related Jobs

Related Jobs to Research Scientist - Voice AI Foundations

Freelance English Transcriptionist (AMER/EMEA)
273 days ago

We are seeking a skilled and experienced Freelance Transcriptionist to join our team. As a Freelance Transcrip...

Technical Writer - Security
278 days ago

We are looking for a Technical Writer to join our tech client’s commercial enterprise services team and wr...

Manager, Communications - People Team
296 days ago

We are seeking a dynamic and experienced Communication Manager to join our team and lead all internal team mem...

Senior or Staff Frontend Engineer - React
295 days ago

We’re looking for an experienced full-time (or part-time) Frontend Software Engineer to join our engin...

Clinical Care Navigator
296 days ago

Lyra is transforming mental health care through technology with a human touch to help people feel emotional...

HR Business Partner
286 days ago

Headway’s mission is a big one – to build a new mental health care system everyone can access. We’v...

Healthcare Customer Service Representative
283 days ago

We’re obsessed with growth. From enabling companies to flourish, to helping careers bloom. SupportNinja wa...

Business Analyst
262 days ago

Piper Companies is seeking a Business Analyst to join a global investment company out of Wayne, PA. This...

Account Executive
280 days ago

Adentro was created to solve one of the most important problems in the modern economy—the majority of consu...

Customer Experience and Insights Manager, Enterprise
284 days ago

Upwork ($UPWK) is the world’s work marketplace. We serve everyone from one-person startups to large, Fortune...

Software Engineer, Platform
297 days ago

Branch is on a mission to help working Americans grow financially. We do this by helping companies accelerate ...

Senior Product Manager
279 days ago

Are you passionate about building products from scratch? Are you ready to manage a product that influences h...

Back-end Developer (Node.js)
270 days ago

The IT сompany Andersen invites a Back-end Developer (Node.js) to work on a large-scale project for our USA ...

Senior CRM Manager
267 days ago

Discord is about giving people the power to create space to find belonging in their lives. We want to make it ...

Account Executive
279 days ago

We believe time is our most precious resource and our mission is to help leaders shift their time from things ...

Account Executive, Enterprise
291 days ago

We are looking for an Account Executive, Enterprise to join our dynamic team of creatives, engineers, market...

Customer Support Frontline Specialist
265 days ago

We are Semrush, a global IT company developing our own product – a platform for digital marketers. New sta...

Principal SEO
222 days ago

We are looking for a Principal SEO to guide the strategy for the company’s organic growth. This role involve...

Consumer Support Specialist
Consumer Support Specialist
236 days ago

At PNC, our people are our greatest differentiator and competitive advantage in the markets we serve. We are a...

Account Executive, Mid Market
205 days ago

We are looking for a seasoned Mid-Market Account Executive to continue driving LeadIQ’s suite of capabilitie...

Enterprise Account Executive
296 days ago

From the very beginning, SugarCRM had a unique vision: to offer a different kind of Customer Relationship Man...

Software Developer
Software Developer
240 days ago

At PNC, our people are our greatest differentiator and competitive advantage in the markets we serve. We are a...

Principal Site Reliability Engineer
296 days ago

Hi, we’re DuckDuckGo, the Internet privacy company for everyone who wants to take back their privacy now. Fo...

Software Engineer
296 days ago

Argyle is a fast-growing, remote-first Series B startup solving a systemic data problem. Underneath the consum...