Data Engineer – Python, SQL, PySpark – ON-SITE

Application deadline date has been passed for this Job.
Acceler8 Talent
  • California
  • Post Date: July 2, 2025
  • 72383
  • Applications 0
  • Views 7
Job Overview

logoAbout Sourcemap

Sourcemap is a pioneer of supply chain transparency and traceability software that spun out of MIT research started in 2008. Since then major traders, manufacturers, and brands have adopted Sourcemap’s full-suite solution for assurance on the raw materials-to-finished goods supply chain, including ongoing monitoring for production, quality, sustainability, and risks such as deforestation and forced labor.

About The Role

Company Overview Sourcemap is the leading provider of supply chain mapping, traceability, and transparency software. We are the only full suite supply chain transparency and traceability solution on the market. Our clients include category-leading global brands, manufacturers and suppliers across the food & agriculture, fashion, beauty, manufacturing and electronics industries. We turn these clients into best-in-class responsible sourcing organizations. We seek committed individuals who will join our team to support our award-winning, values-led work and to tackle important supply chain challenges in a dynamic startup environment. About the Job: Sourcemap is seeking an experienced Data Engineer to join their growing engineering team. You would be joining an enthusiastic and collaborative team of engineers in a fully remote position. This role has a strong hands-on component as you would be building and deploying new features to their platform. You would also take part in the development and mentorship of junior team members by sharing best practices and prior experiences.

What You’ll Do

Development and maintenance of ETL pipelines and ELT pipelinesData Warehouse/Data Lake Development and managementAPI DevelopmentAssembling large, complex sets of data that meet non-functional and functional business requirementsOptimizing data deliveryAutomating Manual data processing proceduresWorking with stakeholders including data scientist, design, product teams, assisting them with data-related technical issuesExcellent analytic skills associated with working with unstructured datasets

What You’ll Bring

5+ years writing production-ready Data Pipelines (Python, SQL, Scala etc)3+ years with relational SQL and NoSQL databases2+ years of experience using geospatial tools, data and techniques(ESRI, QGIS, GDAL, Geopandas, handling multiple geometry types, coordinate transformations etc)A successful history of manipulating, processing and extracting value from large disconnected datasetsArchitecture experience (Microservice, AWS management, etc)Experience using Big Data Tools such as Databricks, Snowflake, Spark etcProject ManagementOther experience: Node, Javascript, GO, Git, DebuggingComfortable with Data Visualization & Web Mapping concepts Other Skills & Qualifications:Effective listening, verbal and written communication skills. Demonstrates openness to others’ ideas.Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.Ability to adapt fast and be agile in a fast-moving environmentEnjoys working in a collaborative environmentGoal-oriented and a self-starterAbility to multitask, prioritize and manage time effectively in order to meet demanding deadlinesA natural leader; enjoys mentoring and guiding junior developersEnthusiastic and positive team player

Job Detail
Shortlist Never pay anyone for job application test or interview.