Key Facts

Remote From:

Anywhere

Freelance

German, English

Hard Skills

Other Skills

•
Teamwork
•
Communication

PradeepIT Consulting Services Pvt Ltd

About PradeepIT Consulting Services Pvt Ltd

Join PradeepIT: Innovating Information Technology and Solutions We pride ourselves on delivering exceptional results in the digital era of Information and Technology. With a track record of successful projects across offshore, nearshore, and onshore locations, we've built a clientele spanning 3 continents and 100+ companies. Our 380+ successful projects reflect our commitment to excellence. At PradeepIT, we prioritize the satisfaction of our resources, followed closely by meeting the needs of our clients and partner companies. Our service offerings include Outsourcing, Managed Services, Recruitment Process Outsourcing, Customer Relationship Management, Web Designing & Development, Content Development, and E-commerce Management. We specialize in SAP S/4 HANA, SAP Customer Data Cloud (Gigya), SAP CPQ (Calliduscloud), SAP Customer Experience, Mobility, Android, iOS & Hybrid development for native application developments. Our core value of "Join As Employee With Us And Grow Up As Entrepreneur" drives our mission. We proudly introduce our venture companies: 1. HG Infotech Pvt Ltd 2. Beli Brother Consulting Pvt Ltd 3. Involvemind 4. Stackglobe 5. Prastuti Consulting 6. Khahara Consulting 7. Thinkhigh And many more to come in the near future. We invite you to be part of our clients' success stories by adopting the latest technology and continually improving processes. If you'd like to learn more about us or have specific questions, visit www.pradeepit.com, call us at 08047363377, or email info@in.pradeepit.com. Let's work together on your next project!

Founded: 2018

Company size: 51 - 200

Website LinkedIn See all jobs →

Job description

Job description:

NLP Engineer / Machine Learning Engineer Document Understanding & Knowledge Graphs

Overview Were looking for a hands-on NLP/ML engineer to lead the development of an intelligent document understanding pipeline for extracting structured data from complex, unstructured RFQ documents (40100+ pages, in German and English).
You will be responsible for building scalable systems that combine document parsing, layout analysis, entity extraction, and knowledge graph construction ultimately feeding downstream (e.g. Analytics and LLM applications.)
Key Responsibilities - - - - - -
Design and implement document hierarchy and section segmentation pipelines using layout-aware models (e.g., DocLayout-YOLO, LayoutLM, Donut).
Build multilingual entity recognition and relation extraction systems across both English and German texts.
Use tools like NLTK, transformers, and spaCy to develop custom tokenization, parsing, and information extraction logic.
Construct and maintain knowledge graphs representing semantic relationships between extracted elements using graph data structures and graph databases (e.g. Neo4j) Integrate outputs into structured LLM-friendly formats (e.g., JSON, Mark Down) for downstream extraction of building material elements.
Collaborate with product and domain experts to align on information schema, ontology, and validation methods. What Were Looking For - - - -
Strong experience in NLP, document understanding, and information extraction from unstructured/multilingual documents.
Proficiency in Python, with experience using libraries such as transformers, spaCy, and NLTK. Hands-on experience with layout-aware models like DocLayout-YOLO, LayoutLM, Donut, or similar.
Familiarity with knowledge graphs and graph databases such as Neo4j, RDF