Proficiency in Python programming., Familiarity with web technologies like HTTP, JSON, HTML, XPath, or JavaScript., Experience with version control systems such as Git., Eagerness to learn new skills and improve existing ones..
Key responsibilities:
Develop and maintain receipt parsing configurations using regexes and XPath.
Troubleshoot and optimize the receipt extraction platform.
Create and analyze new technologies to enhance scalability.
Collaborate with team members through code reviews and pair programming.
Report this Job
Help us maintain the quality of our job listings. If you find any issues
with this job post, please let us know. Select the reason you're reporting
this job:
Numerator is a data and tech company bringing speed and scale to market research. Numerator blends first-party data from over 1 million US households with advanced technology to provide unparalleled 360-degree consumer understanding for the market research industry that has been slow to change. Headquartered in Chicago, IL, Numerator has more than 2,000 employees worldwide. The majority of Fortune 100 companies are Numerator clients.
We’re reinventing the market research industry. Let’s reinvent it together.
At Numerator, we believe tomorrow’s success starts with today’s market intelligence. We empower the world’s leading brands and retailers with unmatched insights into consumer behavior and the influencers that drive it.
Numerator is currently looking for a Configuration Engineer to join our Data Extraction (DX) team in India (Remote). In this role, you will be responsible for helping to grow and maintain a library of thousands of receipt parsing configurations used by fortune 500 brands and retailers. Day-to-day, you’ll come up with creative solutions to complex problems, and learn new skills to complement your existing abilities.
This is a great role for those who are looking for hands-on experience with high visibility and impact. We welcome fresh ideas and approaches as we constantly aim to improve our development processes. Our team has experience using a wide range of technologies and years of cloud and big data experience. We are always learning and growing, so we can guarantee that you won’t be bored with us! If you are seeking an environment where you get to do meaningful work with other great engineers, then we want to hear from you!
What You Will Get To Do
Write clean, efficient, thoroughly tested code, back-up with pair programming and code reviews. Much of our code is Python, but we use all kinds of languages and frameworks.
Create complex regexes that pull structured data out of OCR-transcribed receipt images as well as XPATHs to extract data from receipt emails.
Maintain the platform that drives our receipt extraction at scale.
Troubleshoot, test, and maintain the platform and configurations to ensure strong optimization and functionality.
Evaluate the technical tradeoffs of decisions and build things that last and scale.
Maintain and fix existing configuration issues.
Create and analyze new configuration technologies - figuring out how we can scale up our receipt extraction.
What You'll Bring to Numerator
Programming experience in Python
An eagerness to learn new things, and improve upon existing skills, abilities and practices
Familiarity with web technology, such as HTTP, JSON, HTML, XPath or JavaScript.
Knowledge in an Agile software development environment,
Experience with version control systems (Git, Subversion, etc.).
Have a real passion for clean code and finding elegant solutions to problems.
Eager to expand your knowledge and abilities in python and cloud-based technologies.
Motivation to participate in ongoing learning and growth through pair programming, test-driven development, code reviews, and application of new technologies and best practices.
You look ahead to identify opportunities and foster a culture of innovation.
Good communication (verbal and written)
Nice to haves
Knowledge of web scraping
Knowledge of regular expressions
Knowledge of business rules engines.
Familiarity with virtual software development environments (ie. Vagrant, docker etc.)
Familiarity with object-oriented programming
Scripting knowledge
Familiarity with JSON and similar data formats
Experience with databases, SQL or noSQL.
Programming experience on Unix based infrastructure.
Knowledge of cloud-based systems (EC2, Rackspace, etc.).
Expertise with big data, analytics, machine learning, and personalization.
Required profile
Experience
Spoken language(s):
English
Check out the description to know which languages are mandatory.