5+ years of industry experience in backend development., Fluency in Python and strong command of SQL., Bachelor's or Master's degree in Computer Science., Experience with building data infrastructure and developing APIs..
Key responsibilities:
Develop high-performance data processing systems.
Build and deploy scalable, near real-time data infrastructure.
Incorporate customer feedback into product development.
Ensure high-quality, robust, and efficient code.
Report this Job
Help us maintain the quality of our job listings. If you find any issues
with this job post, please let us know. Select the reason you're reporting
this job:
Scribble Data is a machine learning and generative AI company, helping enterprises build AI-powered data products for advanced analytics. Our flagship product, the Enrich Intelligence platform, enables data preparation and lightweight ML algorithm execution. Our newly launched Hasper engine sits atop Enrich to make it a full-stack LLM data products platform. Backed by Blume Ventures, we enable organizations to unlock the true potential of data with secure hosting options that ensure trust, reliability, and data integrity.
To learn more, visit our website at https://www.scribbledata.io/
Scribble Data is a generative AI (GenAI), machine learning (ML), and advanced analytics data products platform company. It focuses on the Annuities and Pension Risk Transfer (PRT) segments of the Insurance industry in the US, Canada, and UK.
Hasper Scribble Datas flagship product, is designed to implement and transform internalfacing complex workflows to streamline insurance operations, enable actuarial work, and accelerate decisionmaking. It enables businesses to utilize large language models (LLMs) to bridge the gap between decision workflows and natural language, combining ML with a builtin rules engine. It provides a fullstack solution for data preparation and lightweight ML algorithm execution, coupled with a lowcode consumption interface designed for business users.
For added security and trust, Hasper offers various secure hosting options for customers, ranging from onpremise to virtual private cloud.
About the Product
Hasper is a fullstack solution in Python that enables customers to build internalfacing data products and workflows using LLMs and MLRS (ML at Reasonable Scale) robustly and quickly. It trades scale of data for speed of development and deployment, strong auditability, and lower skill. With the tradeoffs, Hasper is able to deliver 5x speedup over existing development processes and tools, and enables endtoend productionisation of use cases with a remarkably high success rate.
About the Role
Scribble Data customers trust them with their data and the outputs and expect cuttingedge tools and thought processes. The product is at the center of the relationship with customers. We work smart, invest in tools and tech that are on the cutting edge of advanced analytics, and package all that we’ve learnt about efficiency and data mileage into our products.
As a Lead Engineer, you will work on extending functionality for Hasper. Your role will also include delivering working systems atop Hasper for Scribble’s customers with high attentiontodetail ensuring trust, robustness, performance, and ease of use. Your responsibilities will include:
Develop highperformance data processing systems.
Develop and deploy robust, distributed, scalable, near realtime, data processing infrastructure that can support various downstream applications including reporting, analytics, user intelligence, machine learning, and decision support.
Incorporate customer feedback and new asks from customers into ongoing deliverables.
Accept nothing less than topquality code and think in terms of code reviews, unit, integration, endtoend tests in a fastmoving development environment, delivering incremental value daily.
Track and research emerging trends as well as competitors, thereby creating feedback loops for the product team.
Requirements
5+ years industry experience in backend development
Fluency in Python; you know the language and its quirks, not having just written scripts.
Strong command of SQL.
Strong understanding of computer science concepts (OOD, algorithm design, data structures, algorithms, execution, and memory optimization).
Strong software engineering skills, systemlevel thinking, and problemsolving ability paired with the love to build and ship robust code.
Experience developing and publishing APIs using web services.
Strong communication and collaboration skills to work with stakeholders from different backgrounds.
BachelorsMasters in Computer Science.
Skills that will give you an edge
Having built a product at a startup.
Experience in building data infrastructure systems for data workflow management.
Experience in fullstack development.
Experience working in dataheavy applications
Fluency with Pandas
Experience using at least some among distributed storage (e.g. S3), serverless architecture (e.g. Lambda), streaming data (e.g. Kafka), and scalable search (e.g. Elasticsearch).
Experience with big data technologies like mapreduce, NoSQL, Spark, HBase, and Hive.
Experience with data security protocols and data access controls.
Product Technology Stack
Python and SQL
Python data science libraries (pandas, numpy, scikitlearn, spacy, etc.)
Deep learning frameworks like Tensorflow, Keras, PyTorch, etc
R, Matlab
We want you to know
Primarily remote work
A work culture that helps you innovate and evolve continuously
A chance to shape the future of ML, marrying it with generative AI
Financial services domain that is career defining
Does this sound like you?
You like adventure and have a stomach for learning and unlearning every day
You are excited by powering up a whole generation of upcoming data applications
You dream big and believe that hard problems need streamlined systems and massive amounts of data to tackle effectively
You embrace uncertainty and can develop processes and systems to handle rapidly evolving requirements
You have the discipline to take audacious goals and break down yearslong roadmaps into near term deliverables that provide value to business stakeholders
You enjoy educating your customers and team members about what it means to be dataoriented and cultivating engineering best practices
You are comfortable selling to customers (not a biggie)
You have a sense of humour
What to expect when you apply
Introductory round with leadership
Take Home assignment taking 35 hours
Technical Interviews: 3 4 rounds
Offer discussion
Interested?
Send us your resume at hello@scribbledata.io
Know of someone else who you think should see this? Spread the word. Tell that friend who is perfect for this role.
Required profile
Experience
Level of experience:Senior (5-10 years)
Spoken language(s):
English
Check out the description to know which languages are mandatory.