Senior Data Engineer

  • Careforth
  • REMOTE, UNAVAILABLE, US
  • 6mo ago
  • Full-time
  • Remote

Position Summary

This individual will contribute to the evolution of the enterprise DataHub/Delta Lake architecture, driving best practices in cloud-native, Spark-based data engineering. The ideal candidate has hands-on expertise with Databricks, AWS Cloud services, and Apache Spark (Python/Scala), and can serve as both a technical leader and mentor for junior engineers.

What You Will Do:

  • Architect and build robust, scalable, and secure data pipelines leveraging Databricks, Apache Spark, and AWS Cloud (EMR, Redshift, S3, Glue, Lambda).
  • Participate and represent the data team in critical design discussions with technical leads across various product lines.
  • Collaborate closely with application developers, product managers, and business analysts to translate requirements into data models, ETL/ELT workflows, and analytics-ready datasets.
  • Conduct pull request reviews and enforce engineering excellence in code quality, testing, and performance optimization.
  • Troubleshoot and optimize production data workflows while ensuring observability, resilience, and cost-efficiency at scale.
  • Research, evaluate, and apply emerging tools and technologies to continuously modernize the data engineering ecosystem.
  • Act as a mentor to junior engineers, fostering a culture of collaboration, innovation, and continuous learning.

What You Will Bring:

  • Bachelor’s or Master’s degree in Computer Science, Data Engineering, or a related field.
  • 7–10 years of professional data engineering experience, including at least 3+ years with modern cloud-based data lake architectures.
  • Deep expertise in Apache Spark (PySpark, Scala, or Java) for large-scale distributed data processing.
  • Strong experience with Databricks for collaborative data engineering and advanced analytics.
  • Hands-on experience with AWS services including EMR, Redshift, S3, Glue, Lambda, IAM, and related cloud-native data tools.
  • Proficiency in Python (preferred) as well as Java or Scala.
  • Strong understanding of data modeling, data pipelines, and workflow orchestration (Airflow, Step Functions, or similar).
  • Solid foundation in algorithms, data structures, and software engineering best practices.
  • Excellent communication skills and a proven ability to work cross-functionally with product and engineering teams.

You'll Benefit From:

At Careforth your well-being matters. With flexible schedules, a remote-first culture, and a nationally recognized wellness program, our benefits are designed to help you thrive, both professionally and personally. Discover how we invest in you: https://careforth.com/careers/#benefits

 

The pay range for this position is $125K - $185K. The actual wage offered may be lower or higher depending on budget and candidate experience, knowledge, skills, qualifications, and geographic location.

 

#LI-Remote (exceptions NYC, CA, and CO)