Data Scientist, Research Data Insights
Job Description
We are Digital Science and we are advancing the research ecosystem.Â
Dimensions, part of the Digital Science family, is the world’s largest linked research information dataset, covering millions of research publications and connected by more than 1.3 billion citations. We are shaping the future of research and are looking for a Data Scientist to join the team.Â
As part of a dynamic team environment you will support our global customers through the development of new analytic approaches and capabilities leveraging our scientometric data sets and emerging knowledge graph ecosystem. You will help our customers, including the largest funding and research organizations in the U.S. Federal government and beyond, to more effectively manage their multi-billion dollar research portfolios by providing delivery excellence that delights our customers, fuels word-of-mouth growth, and very high renewal rates. You will leverage our data and platforms, including Dimensions and the rest of Digital Sciences portfolio to support research assessment, portfolio management/analysis, strategic planning and more.
- Conduct large-scale, quantitative data analysis (millions of records) potentially including custom indexing, data linking, data collection and other data wrangling using Dimensions in-house data assets and external or customer data sets as required.
- Leverage Large Language Models and other AI technologies to address customer analytical needs, identifying opportunities to incorporate these tools into analytic workflows and customer facing applications.
- Plan, design, maintain and document data integrations, pipelines, internal use utilities, tools and software packages to support our advanced analytic capabilities.
- Build machine-learning models that operate on large, text-based documents (10s - 100s of millions of documents), for a variety of applications including named entity resolution, relationship extraction, document clustering and topic modeling.
- Create and deploy visualizations and interactive web-based dashboards, using tools such as Plotly, Dash, and React.
- You will have a good understanding of the S&T ecosystem - funders, research organizations, scientific publishing and related experience working with bibliometric/scientometric datasets such as scientific publications, grants, and patents.
- You will have familiarity with knowledge graphs (including technologies such as RDF and SPARQL). Ideally, you will experience building and querying knowledge graphs in support of analytic workloads leveraging bibliometric/scientometric data sets.
- You will have experience in Python, including relevant Python libraries and modules such as pandas, scikit learn, gensim, transformers, pyTorch and Dash.
- You will have familiarity with commercial AI models like GPT, Bard or Palm and ideally experience working with LLM support toolkits such as LangChain, Guidance, and Haystack.
- You’ll be experienced in Natural Language Processing and machine learning methods with bibliometric/scientometric datasets.
- You will have experience with data visualization tools (Plotly, D3, matplotlib etc)
- You will thrive in an environment where you can work independently and remotely
- You will have previous experience of working globally and across multiple teams
- You will be a strong communicator and able to communicate your findings to a varied audience through written and verbal presentation
- You will have 3-5 years of experience delivering customer solutions.
Current US Public Trust clearance preferred, as applicants will be subject to a security investigation and will need to meet eligibility requirements for access to sensitive information.Â
We're an equal opportunity employer. All applicants will be considered for employment without attention to race, colour, religion, sex, sexual orientation, gender identity, national origin, veteran or disability status.
Date Posted
10/20/2024
Views
0
Similar Jobs
Data Scientist - Capstone Integrated Solutions
Views in the last 30 days - 0
Capstone Integrated Solutions promotes itself as a customerfocused provider offering comprehensive software services and seeks a Data Scientist with e...
View DetailsEngineering Manager - Software Supply Chain Security: Auth Infrastructure - GitLab
Views in the last 30 days - 0
This job description highlights a leadership role in developing secure scalable authentication infrastructure for GitLab It emphasizes technical exper...
View DetailsStaff Salesforce Engineer - CRM Systems - GitLab
Views in the last 30 days - 0
This job description outlines a Staff Salesforce Developer role focusing on designing building and scaling enterprisegrade solutions across Salesforce...
View DetailsGrowth Product Lead - Loyalty - Trafilea
Views in the last 30 days - 0
Trafilea promotes itself as a transformative consumer tech platform with AIdriven growth solutions highlighting achievements like 1B revenue and globa...
View DetailsSales Prospecting Account Executive - Financial Solutions - Blackbaud
Views in the last 30 days - 0
This job posting seeks Prospect Account Executives to sell Financial Management applications for nonprofits and governments Responsibilities include s...
View DetailsSolutions Architect - phData
Views in the last 30 days - 0
This job posting seeks a Solutions Architect to join phDatas Elastic Platform Operations team focusing on cloudnative data platforms like Snowflake AW...
View Details