Match score not available

Semantic Models of Structured Data Internship

Remote: 
Full Remote
Contract: 
Experience: 
Entry-level / graduate
Work from: 
California (USA), United States

Offer summary

Qualifications:

Enrolled in BS/MS/PhD program., Interest in Generative AI Methods., Familiarity with Knowledge Graphs., Technical skills in Linux, Python, SQL..

Key responsabilities:

  • Assist ongoing research on Semantic Labeling.
  • Contribute to building a Knowledge Graph.
Teradata logo
Teradata XLarge https://www.teradata.com
10001 Employees
See more Teradata offers

Job description

Our Company

At Teradata, we believe that people thrive when empowered with better information. That’s why we built the most complete cloud analytics and data platform for AI. By delivering harmonized data, trusted AI, and faster innovation, we uplift and empower our customers and our customers’ customers to make better, more confident decisions. The world’s top companies across every major industry trust Teradata to improve business performance, enrich customer experiences, and fully integrate data across the enterprise.

Our Internship Program

Our summer internship program lasts 10-12 weeks beginning in May/June and ending in August/September. We offer a fast-paced, flexible, fun environment where you will have the opportunity to work on meaningful projects and face new challenges every day.

Location

Our program is fully virtual. Interns must remain in the US for the duration of the internship.

What You'll Do

  • Assist ongoing research by Senior Teradata staff into the integration of Semantic Labeling of structured data with Retrieval Augmented (query) Generation (RAG) tools.
  • We have an ongoing (three year) project that takes as its goal discovering "Semantics" (structure, rules, features) of a Structured Data Corpus. The internship will be focused on using this Semantic Map to automatically build a Knowledge Graph which can then be used by the RAG tool sets to better support "Natural Language" data analysis.

There are three ways we will measure success in this project:

  • Product Impact - Teradata has a team dedicated to RAG work. We would like to (positively) impact their product development schedule.
  • Patent Impact - We would like to be able to submit Intellectual Disclosure Reports (IDRs) that Teradata can submit as Patents.
  • Paper Impact - We would like to be able to write up the results of the work for academic publication (SIGMOD, VLDB, etc)

Who You'll Work With

  • The team consists of Paul G. Brown (Teradata Engineering Fellow) to whom the intern will report, and a small number (one to two) of Teradata employees (TBD) working within the Research Group at Teradata. In addition to being focused on the Knowledge Graph discovery, the project team will be working with the core Teradata Engineering Team responsible for the company's RAG features and functionality.

What Makes You a Qualified Candidate

  • Must be enrolled in BS/MS/PhD program, with a graduation date between December 2025-June 2027
  • Background of study and interest in the use of Generative AI Methods in support of data analysis (RAG).
  • Familiarity with Knowledge Graphs and their applications in search / query / analysis, specifically in the context of Structured Repositories.
  • That is, we're trying to address requirements for automatic support of RAG in the context of Structured Data Lakes, rather than bodies of unstructured text / images etc.
  • A technical skillset that includes Linux command line tools (sed, awk, bash), Python (Jupyter Notebooks), and SQL. In addition, we will be using git for project management / source code control.
  • Must be in the US for the duration of the internship.

What You’ll Bring

  • A "research" mindset . . . we are not certain if and how what we plan to achieve is possible but we're convinced that we have a couple of good ideas to start with.
  • If you have experience with Extensible SQL (writing and using User-Defined Extensions to a SQL DBMS) using 'C' / Java etc, that would be handy.
  • An appreciation of Graph Theory would be of significant benefit.

Pay Rate: $29. 00 - $44. 00 - $50. 00 Hourly



Why We Think You’ll Love Teradata

We prioritize a people-first culture because we know our people are at the very heart of our success. We embrace a flexible work model because we trust our people to make decisions about how, when, and where they work. We focus on well-being because we care about our people and their ability to thrive both personally and professionally. We are an anti-racist company because our dedication to Diversity, Equity, and Inclusion is more than a statement. It is a deep commitment to doing the work to foster an equitable environment that celebrates people for all of who they are.

Teradata invites all identities and backgrounds in the workplace. We work with deliberation and intent to ensure we are cultivating collaboration and inclusivity across our global organization. ​ We are proud to be an equal opportunity and affirmative action employer. We do not discriminate based upon race, color, ancestry, religion, creed, sex (including pregnancy, childbirth, breastfeeding, or related conditions), national origin, sexual orientation, age, citizenship, marital status, disability, medical condition, genetic information, gender identity or expression, military and veteran status, or any other legally protected status.

Required profile

Experience

Level of experience: Entry-level / graduate
Spoken language(s):
English
Check out the description to know which languages are mandatory.

Related jobs