About You

You are an experienced Data Engineer who can help us advance our automatic speech recognition (ASR) and is excited to build the voice interfaces of the future. You will be working with millions of hours of audio and billions of words to support our models which train across dozens of GPUs. It’s all about finding the bottlenecks across our 30+ languages and targeting our efforts to understand every voice. We want our pipelines to scale with more data as we start training billion parameter models.

An average day might include working on:

  • Investigating how emotional state affects transcription accuracy on the Friends TV show!
  • Scraping data for dozens of new languages by crawling thousands of webpages
  • Training PyTorch neural networks on new accent data and analysing the predictions
  • Improving and extending our model building pipelines and how we store artefacts and metadata
  • Partnering with new external vendors to improve our data acquisition

We aim to get you onboarded and started on something like this in your first few days. In addition, having a very collaborative culture, you will often be pair programming with another team member on a new ETL pipeline project, reviewing other folks’ code and suggesting new ways to tackle a tough memory usage problem as well as brainstorming novel approaches for analysing model predictions with the team.

You’ll want to join our team if you:

  • Are results driven, like moving fast and keeping things simple
  • Love working in collaborative and ambitious teams
  • Have a growth mindset and love to develop yourself and others
  • Enjoy solving challenging problems and digging into a stack of unfamiliar code
  • Can quickly manipulate and evaluate copious amounts of data
  • Are a code optimising guru

You will have experience in some of the following:

  • ETL pipelines in Python (we like Prefect and Airflow)
  • Distributed computing (for example, shell scripting with HPC grids and Spark clusters)
  • Working with speech or text data in industry
  • Deep learning and Pytorch
  • Speaking multiple languages or have a linguistics background

What we can offer you:

Speechmatics is a collective team of ambitious, problem solvers and thought-leaders paving the way for inclusion in speech recognition technology 🗣🎙.

No matter what stage of your career you're at - from paid internships and first-job opportunities through to management and senior positions - we'll support you with the training and development 🏋️ needed to reach your career aspirations with us. There really is no shortage of opportunities here for you to get involved and collaborate with those around you to deliver your best work 📈.

When you become part of the Speechmatics Team we work hard to make sure you do your best work with us 💪, while also having a good time doing it 😆. With our Focus Fridays you get an undisturbed day of focus 🧘‍♀️, offset with Together Tuesdays when we have our team meetings 👫. 

We offer incredibly flexible working 🤸, regular company lunches, and birthday celebrations🎉. But that's not all. We've spoken to our teams to find out what they want. From Private Medical 🏥 and Dental 🦷 for you and your family, through to global working opportunities 🌎, a generous holiday allowance 🏝 and pension/401K matching 🪺, we want to make sure our employees and their families are looked after. Every employee will receive a working from home allowance for tech or home office equipment (on top of your choice of laptop/ Mac, screen and accessories of course) 🧑‍💻!

We support people to work wherever they work best. But we also understand the importance of coming together to collaborate, socialise and build relationships. Individuals and teams are free to decide what works for them.

Who we are:

Speechmatics are global experts in deep learning and speech recognition, providing technology that understands every voice. We have built the most accurate and inclusive speech-to-text engine available 🗣🎙, which is now working with an amazing mix of global companies 🌎. We have recently raised $62 million at Series B and continue to grow positively 🌻.

Joining us means working with some of the smartest minds around the world 🤯, focused on cutting-edge projects and deploying the latest techniques to disrupt the market. We believe in putting people first 🥰; we’ll do all we can to help you develop your skills and give you the tools you need to thrive 📈. We support people to work wherever they work best and also understand the importance of coming together to collaborate, socialise and build relationships 🙌.

This is only the beginning; we’re looking for amazing people like you to continue our journey… 🚀

At Speechmatics, our mission has always been to ‘Understand Every Voice’.

We believe that recruiting talent with diverse experiences, perspectives and backgrounds encourages people to think differently and be more creative.

We welcome difference whether it’s gender, gender identity or expression, race, disability, age, sexual orientation, religion or belief, marital status, national origin, veteran status, or pregnancy and maternity status; so please be yourself!

For more information on usplease visit our website and follow Speechmatics on our social channels via Twitter, Facebook, LinkedIn, and YouTube.

We rely on legitimate interest as a legal basis for processing personal information under the GDPR for purposes of recruitment and applications for employment. 
#LI-Hybrid

Apply for this Job

* Required