Job Description
Job Title:Data Architect
Location:Remote First, office in Austin. In Person with team 2-4x per year
Team:Engineering
Reports To:CTO
About Service Direct
Service Direct, in business since 2006, is a leader in the Pay Per Call Lead Generation landscape. We pride ourselves on providing high quality inbound Leads to our Clients and in a world where Lead Generation can have a bad rap, we win by being ethical, radically transparent and by treating everyone with respect. We have built a high performing Lead Marketplace that connects Publishers and Lead Generators to Local Small Businesses. Every day thousands of consumers find the companies and services they need through Service Direct’s real-time auction system.
We are a technology-driven company that relies heavily on the software we build to scale our services and empower our Teammates and Clients. Our mission is to provide excellent service through informed, empowered people backed by strong data and powerful systems.
We are a small team doing outsized work. We do this by supporting each other and taking responsibility for our own output. We work together because it’s challenging, fun, and rewarding to build solid software that our teams and clients truly appreciate - and use every day to grow their organizations.
The Role
As Service Direct’s first data management hire, you will lead our efforts to clarify & polish our data to enable outstanding client experiences and excellent business outcomes. We have been building our data systems and pipelines for a while and face both legacy challenges and the strong push to develop new and deeper insights with our data. You will impact and interface with every department in our company and work closely with our data thirsty management team who have years of observational experience and want to prove hypotheses and implement solutions with better data understanding.
You will be responsible for our data architecture and governance ensuring reliable, consistent, and timely data is available for all our consumers. End to End.
Your work will begin by coming alongside the team members that are currently working on data projects. As you learn what is going on you’ll take responsibility for guiding, teaching and mentoring the team and you will be individually contributing and building excellent data solutions. You will refine (or define) processes and documentation. You will advocate for better tooling and systems and then implement those solutions and drive the migrations and adoption of best data practices.
We think this work will breakdown like 40% Individual Contributor / 60% Teach, Manage, Mentor.
The Tech
Our data exists in several systems and we have a few iterations of data pipelines still in operation. Your goal will be to build / rebuild an enterprise worthy data system, to take us to the next level of data trustworthiness and data inspiration. Our applications consume ~5M inbound requests and creates ~20M new database records per day. The tech we currently use includes:
• Databases: MySQL/MariaDB, Datastream, BigQuery
• Backend: PHP, Node.js; running on Debian and Google App Engine
• Caching:Memcached, Redis
• Frontend: React (over Vite), Native JS
• Machine Learning / AI: VertexAI, Scikit-learn, Python, Jupyter
• Infrastructure: Bare metal servers, Google Cloud Platform, bash, cron, apache2, nginx
• Data / Business Intelligence:Dataform, Looker Studio
Our data processing includes
• PHP and SQL scripts run on bare metal servers and uploading transformed data into BigQuery
• BigQuery Scheduled Queries that transform data into other tables
• Data replication from hosted hardware into BigQuery via Datastream (Google’s CDC service)
• Redis queues capturing high frequency data and shipping into BigQuery in batches
• Newest ETL efforts are in Google Dataform where we have more SQL that transforms data into final tables for reporting and some application use
• New Pub/Sub infrastructure powering new Webhooks to deliver data to Clients
About You
Being a seasoned data veteran, you understand that data truly does power innovation and your desire to make big impacts has you already asking questions about what we are doing and how we can improve. You are a great listener and combined with your organizational brain and skills, you thrive making someone’s data dreams a reality. You enjoy flexing your visualization skills, built on top of your excellence in architecting and orchestrating great data solutions.
Your curiosity, drive and thoughtful approaches make you an excellent addition to our lean team as you are both a get it done person and also a coach of your teammates as you steer us into better data solutions and processes.
Your Qualifications
• Deep understanding of Data Governance best practices and the desire + ability to lead excellent data processes across diverse teams.
• Strong experience in SQL and working with cloud-based data warehouses (preferably BigQuery).
• Strong experience working with Application development and guiding solutions towards excellent data.
• Proven ability to design, manage, and optimize data transformation pipelines using tools such as Dataform.
• Excellent at discovering, validating and visualizing truths in data.
• Demonstrated experience in building and maintaining reports/dashboards with Looker Studio (or similar BI tools).
• Excellent communication skills to work effectively with technical and non-technical stakeholders.
• A proactive and self-motivated mindset, guided by intuition to pursue excellence both within the data and across the organization.
Expected Outcomes & Impact
Trustworthy, Actionable & Insightful Data for Teammates and Clients
• Excellent data changes company in so many ways. From CRM Automations to Bidding Algorithm Models, data studied and implemented well will dramatically improve our business.
Our Data Systems scale well with our growing business and operate smoothly with well implemented Data Governance principles
• With optimal data architecture, we achieve high reliability at the most efficient costs while maintaining enterprise security.
• Our BI teams know how to access and use trusted data sources and our Dev teams know how to enhance and debug existing data architectures and pipelines.
Job Type: Full-time
Pay: $170,000.00 - $205,000.00 per year
Benefits:
• 401(k)
• Dental insurance
• Flexible schedule
• Health insurance
• Paid time off
• Vision insurance
Application Question(s):
• Please answer these questions with your application:
1. Flip the script here and ask us the questions that are on your mind after reading this job description. What do you want to know more about? What ideas have you immediately thought through? What do you suspect we are missing?
2. Why do you like working with data?
Work Location: Hybrid remote in Austin, TX 78745