ZoomInfo
Senior Data Scientist – Information Retrieval & NLP
Job Description
At ZoomInfo, we encourage creativity, value innovation, demand teamwork, expect accountability and cherish results. We value your take charge, take initiative, get stuff done attitude and will help you unlock your growth potential. One great choice can change everything. Thrive with us at ZoomInfo.
ZoomInfo is redefining how 40,000+ revenue teams find, engage, and win customers. Our next leap forward: lightning-fast, hyper-accurate information retrieval powered by Large & Small Language Models. We’re assembling a best-in-class Applied AI group and are hiring a Senior Data Scientist to own core retrieval, NER, and aligned entity-resolution & knowledge-graph initiatives that touch billions of records and serve millions of daily queries.
What you will do:
- End-to-End Retrieval Modeling
- Invent and productionize Transformer/RAG architectures that surface the right contact, company, or insight.
- Drive quantization, distillation, and SLM fine-tuning so models stay fast and affordable at petabyte scale.
- Prototype and launch hybrid dense/sparse retrieval pipelines on vector DBs (Pinecone, Weaviate, FAISS, OpenSearch).
- Named-Entity Recognition & Resolution
- Own high-recall NER models that tag people, orgs, locations, and industry-specific entities across multi-language text.
- Build cross-dataset entity-resolution frameworks that dedupe hundreds of millions of records with sub-second latency; enrich with knowledge-graph signals where valuable.
- Experimentation
- Design large-scale A/B and back-testing plans; close the loop from experiment to KPI uplift.
- Cross-Functional Impact
- Translate product goals into measurable ML KPIs; influence roadmap, capacity, and investment decisions.
- Mentor junior scientists/engineers; publish internal requirements documents, external blogs, and present at conferences.
What you will bring:
- 7+ yrs hands-on ML/NLP experience (or 4+ yrs post-PhD/Master’s) with at least two delivered, revenue-impacting products.
- Expertise in transformer stacks (BERT/GPT/T5), RAG, vector-based IR, and latency/throughput optimization.
- Proven track record building NER or entity-resolution systems at 100M+ record scale; knowledge-graph experience is a plus.
- Strong applied research chops (PyTorch or TensorFlow) paired with software-engineering rigor (Python, Go/Java a plus).
- Desire to work within MLOps tools and frameworks: Docker, K8s, GitOps, Terraform, feature stores, model registries, automated retraining.
- Ability to persuade exec and non-tech audiences with data-driven storytelling; comfortable owning strategy & budget.
#LI-SK
#LI-Remote
Actual compensation offered will be based on factors such as the candidate’s work location, qualifications, skills, experience and/or training. Your recruiter can share more information about the specific salary range for your desired work location during the hiring process. We want our employees and their families to thrive.
In addition to comprehensive benefits we offer holistic mind, body and lifestyle programs designed for overall well-being. Learn more about ZoomInfo benefits here.
About us:
ZoomInfo (NASDAQ: ZI) is the Go-To-Market Intelligence Platform that empowers businesses to grow faster with AI-ready insights, trusted data, and advanced automation. Its solutions provide more than 35,000 companies worldwide with a complete view of their customers, making every seller their best seller.
ZoomInfo may use a software-based assessment as part of the recruitment process. More information about this tool, including the results of the most recent bias audit, is available here.
ZoomInfo is proud to be an equal opportunity employer, hiring based on qualifications, merit, and business needs, and does not discriminate based on protected status. We welcome all applicants and are committed to providing equal employment opportunities regardless of sex, race, age, color, national origin, sexual orientation, gender identity, marital status, disability status, religion, protected military or veteran status, medical condition, or any other characteristic protected by applicable law. We also consider qualified candidates with criminal histories in accordance with legal requirements.