Senior Data Engineer

Location: San Francisco, CA

Overview

TruSTAR is a cyber intelligence platform built to help security and fraud teams make better decisions, faster. This position offers a unique opportunity for talented data engineers to build mission critical data pipelines that help connect a network of enterprise security operators facing some of the most sophisticated adversaries.

Working on the engineering team at TruSTAR is a unique mix of working at a fast-moving early-stage startup and an enterprise security company. The enterprise market has historically been saturated with monolithic, poorly designed software solutions that rarely change. At TruSTAR we take the opposite approach—we use the most modern tools to help our clients solve some of their most complicated security related data challenges.

About TruSTAR's Engineering Team

Our team works in a microservices-oriented architecture using modern technologies like Docker, Gitlab CI/CD, Kafka, gRPC, Spark, among others. We pride ourselves in using an “Infrastructure-As-Code” approach that ensures our code and infrastructure are secure and easily deployed in the cloud. Our engineers are empowered to try on new technologies and contribute to streamlining our stack while they evolve the platform to the next level.

Your Roles and Responsibilities 

  • Work on business critical data pipelines that increase the security of our fortune 500 clients by operationalizing data ingested from their tools and premium intelligence feeds.
  • Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of sources and tools using SQL, Spark, Kafka and AWS.
  • Design, architect and support new and existing data and ETL pipelines and recommend improvements and modifications with a focus on maintainability, scalability, and performance.
  • Analyze, debug and correct issues with data pipelines.
  • Work closely with Data Scientists to make the process of training, testing, and deployment of Machine Learning models seamless.
  • Communicate strategies and processes around data modeling and architecture to multi-functional groups and senior level management.
  • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc.

Minimum Qualifications 

  • At least 3 years of experience building complex ETL pipelines using Spark.
  • Proficiency in Scala / Java.
  • Experience writing complex SQL and ETL processes.
  • BS in Computer Science / Software Engineering or equivalent experience.
  • Strong knowledge of Apache Spark, Spark streaming, Apache Kafka and similar technology stacks.
  • Strong understanding & usage of algorithms and data structures. 

Bonus Points

  • Background working with cybersecurity data.
  • Experience working with Python. 
  • Experience with cloud service providers such as AWS / Databricks.

How To Apply

Send your resume and CV to jobs@trustar.co with the subject title “Senior Data Engineer” to apply today.

TruSTAR embraces diversity. We are proud to be an equal opportunity workplace and do not discriminate on the basis of sex, race, color, age, sexual orientation, gender identity, religion, national origin, citizenship, marital status, veteran status, or disability status.