Shu-Hao (Ken) Wu
Software Engineer / Site Reliability Engineer
Seasoned Software Engineer with several years of experience in site reliability, software development, and data engineering. Proficient in Terraform, Python, Java, and JavaScript; adept at enhancing software functionality and system reliability. Seeking to leverage expertise in software engineering to drive innovation and efficiency in a new role.
Work Experience
Software Engineer at Ponddy Education Inc.
2020 - 2021 | Taipei, Taiwan
Engineering
- Created a Python-based web scraping tool that autonomously aggregated data from more than 10,000 articles, reducing data aggregation time by approximately 90%
- Employed AWS EC2 instances to run and manage the data pipeline, ensuring high availability and performance
- Migrated database code from PostgreSQL to Django's ORM, optimizing data handling and framework efficiency
- Scheduled and managed tasks with Apache Airflow, ensuring regular, automated data pipeline collection with robust monitoring
- Utilized AWS S3 for efficient and structured data storage, ensuring easy access and management of collected data
Software Engineer at PhysIQ
2022 - 2023 | Chicago, Illinois
Engineering
- Designed and implemented Slack bot using Node.js, reducing onboarding time for new hires by 33%, saving approximately 4 hours per new hire
- Developed web-based capacity calculator using Python Flask, enabling staff to accurately estimate available working hours, resulting in a 20% improvement in resource allocation
- Led the migration of the algorithm pipeline to support real-time health monitoring by making key health data attributes listenable
- Refined project build automation and facilitated cross-platform development by integrating the Python Nox library
- Resolved Kotlin and MySQL RESTful API backend data retrieval issues by debugging on Linux, updating MySQL queries, and refining Kotlin validation methods
- Utilized fixturesets in MySQL to generate test data, validating the new pipeline's output for accuracy and reliability
Data Engineer at Allstate Insurance Company
2023 - 2024 | USA
Data Engineering
- Architected comprehensive data pipeline using Azure Data Factory, reducing data processing time by 50%
- Created transformation pipeline in Azure Data Factory to convert raw data into Parquet format, enhancing data query performance by 30%
- Configured Azure Blob Storage to systematically store raw, processed, and transformed data, providing a clear and organized data management system
- Designed star schema for data storage, optimizing query performance and reducing execution time
- Developed ETL pipeline using Python to load CSV and raw data into PostgreSQL
- Established storage layers for landing, bronze, quarantine, and silver data, improving data organization and quality control
Site Reliability Engineer at THECOO Inc.
2024 - 2025 | Japan
Infrastructure/Site Reliability
- Fixed resource inconsistencies in AWS Lambda by verifying dependencies with services like EventBridge, ensuring all related resources were properly imported into Terraform
- Improved infrastructure reliability by enforcing Infrastructure-as-Code best practices, reducing manual interventions and ensuring a fully Terraform-managed cloud environment
- Isolated production and non-production env in GCP by refactoring Node.js backend and Terraform code, improving data access control, deployment safety, and debugging efficiency
- Refactored Kubernetes environment from MVP by restructuring configurations and optimizing resource allocation, improving deployment stability and reducing setup time
Backend Engineering
- Added indexes to the point payout logs table based on different transaction types, improving query efficiency
- Designed Redis session store in Node.js to support user-ID based session deletion, enabling force logout across devices
- Integrated force-logout mechanism into key API routes for session invalidation on account deletion/password reset
- Migrated MySQL schema with a new operation fee table and foreign keys to existing tables (lottery, etc.)
- Wrote Jest integration tests covering session creation, retrieval, deletion, and force-logout flows
Other Contributions
- Developed a Slack bot in Node.js to fetch and display HR documents in seconds instead of minutes
- Implemented Slack API calls for channel message search based on user queries
- Used Google Drive API to list/filter documents and return filename+link in Slack
Software Engineer at Treasure Data
2026 - Present | Tokyo, Japan
Backend Engineering
- Built automated email reminder workflows for expiring API keys, helping customers avoid unexpected API disruptions
- Designed and implemented MySQL database schema changes and backend logic for configurable API key expiration and reminder policies
- Led the implementation of API key expiration and reminder mechanisms by updating authentication flows and Rails backend endpoints, preventing indefinite API key usage
Frontend Engineering
- Developed an API key management interface in React with configurable expiration settings and notification banners, improving visibility of upcoming API key expirations
Projects
Event Ticketing Service
Event ticketing platform for browsing events, reserving tickets, and managing bookings
Tech: Python, FastAPI, MySQL, SQLAlchemy, Docker
GitHub Receipt Processor Service
Go web service that processes receipts and calculates reward points
Tech: Go, Docker
GitHub MLB Rumors Analysis
Data pipeline for scraping, analyzing, and visualizing MLB trade rumors with sentiment analysis
Tech: Python, Airflow, AWS, Tableau, Pandas
GitHub NewsFlow
Trending news dashboard with keyword search
Tech: Python, Dash
Unit Conversion Service
gRPC-based unit conversion service for converting between measurement units
Tech: Go, gRPC, Protocol Buffers
GitHub Asteroids Game
JavaFX arcade game where players pilot a spacecraft to destroy asteroids
Tech: Java, JavaFX, Maven
GitHub Vocabulary Practice App
JavaFX desktop app for learning vocabulary through word entry and interactive quizzes
Tech: Java, JavaFX
GitHub Hungry Snake
Classic snake game where players eat food and grow while avoiding collisions
Tech: Java
GitHub Hotel Reservation App
CLI-based hotel room reservation system with OOP design and Java Collections
Tech: Java
GitHub PyCleanCodeCLI
CLI tool to clean comments and unnecessary whitespace from Python files
Tech: Python
GitHub Ledger Flux
Privacy-first offline personal finance PWA with expense tracking, investment portfolio monitoring, and multi-currency support
Tech: TypeScript, Next.js, React, Tailwind CSS, IndexedDB, PWA
Skills
Languages
- Python: 100%
- JavaScript: 80%
- Java: 88%
- SQL: 95%
- HTML/CSS: 90%
- Ruby: 80%
- Terraform: 95%
Cloud & Infrastructure
- AWS (EC2, S3, Lambda): 90%
- GCP: 80%
- Azure (ADF, Blob): 75%
- Kubernetes: 80%
- Docker: 85%
Frameworks & Tools
- Node.js: 85%
- Flask / Django: 85%
- MySQL / PostgreSQL: 90%
- Redis: 80%
- Apache Airflow: 75%
- Git / CI-CD: 90%
Education
Master of Science in Information Science, University of Illinois at Urbana-Champaign
Contact
Email: ken.wu.951009@gmail.com
GitHub LinkedIn