









【Capsule】
At FunNow, we’re building joyful experiences, at the speed of now. As a Site Reliability Engineer, you’ll play a crucial role in ensuring our platform stays fast, resilient, and secure for millions of users booking spontaneous fun across Asia. But here’s the twist: we don’t just monitor uptime — we build with AI and automation. From Kubernetes tuning to auto-healing infrastructure, CI/CD pipelines to incident response, you'll be hands-on in evolving our DevOps culture. If you love scalable systems, believe in developer efficiency, and treat infrastructure as code, welcome aboard.
【Typical Accountability】
1. Design robust architectures to comprehensively improve system availability, scalability, and service quality
2. Ensure stable service operation, monitor core service status, and quickly troubleshoot issues
3. Conduct in-depth analysis of system performance bottlenecks and propose and implement improvement solutions
4. Maintain and optimize Kubernetes clusters (EKS/GKE), effectively handling resource pressure, node anomalies, and other situations
5. Maintain and improve CI/CD pipelines and automated deployment systems (GitHub Actions / ArgoCD) to significantly enhance engineering team development efficiency
6. Establish and continuously optimize system monitoring and alerting mechanisms (Prometheus / Grafana / Alertmanager)
7. Assist with incident response and problem investigation
8. Regularly participate in system inspections and audits, proactively proposing and implementing improvements
9. Assist in maintaining and implementing fundamental security settings (e.g., IAM, resource permissions, encrypted storage)
10. Actively share your experience to collectively enhance the team's engineering culture
【Essential Competencies】
1. Familiarity with container technologies such as Docker or Kubernetes, and practical experience with Kubernetes operations (deployment, scheduling, resource management)
2. Familiarity with AWS services (e.g., ECS, EKS, S3, CloudFront, IAM, VPC, etc.), and practical experience maintaining AWS or GCP (we primarily use AWS)
3. Familiarity with at least one CI/CD tool (e.g., GitHub Actions, GitLab CI)
4. Proficiency in MySQL daily management and performance analysis
5. Familiarity with service-related log analysis and monitoring tools (e.g., CloudWatch, ELK/EFK, Grafana), and practical experience with Prometheus/Grafana
6. Experience maintaining Elasticsearch clusters
7. Familiarity with Git and basic Git flow operations
8. High degree of self-management, proactive and responsible work attitude, meticulousness, and excellent communication and teamwork skills
【Desirable Competencies】
1. Exposure to or familiarity with the Golang ecosystem
2. Familiarity with Infra-as-Code tools such as CDK, Terraform
3. Experience with IPO advisory or ISO audit
4. Security awareness
【Who You Are】
1. You enjoy solving real-world problems, are proactive in investigation, and act quickly
2. You value stability and data accuracy, and possess a high sense of responsibility
3. You are passionate about learning new tools and enjoy sharing improvement methods
4. You maintain clear communication and good documentation habits in team collaboration
身為大東南亞地區領先的生活風格預訂平台,也是深受喜愛的品牌,我們正重新定義人們探索與享受休閒生活的方式 — 靠著我們自主研發的 AI 技術與對用戶的深入理解。無論是一場臨時的按摩、一頓即興晚餐,或是一場下班後說走就走的放鬆活動,我們都致力讓每一次體驗更貼心、有趣、懂你。
我們獨創的智慧收益管理系統,協助服務業商家即時優化營運效率,同時為超過 800 萬名用戶,提供多元、精選且高品質的生活體驗 —— 只需動動手指,就能輕鬆預訂理想時刻。
FUNNOW 生態系遍佈七個國家:台灣、新加坡、香港、馬來西亞、泰國、菲律賓與日本,旗下擁有四個活躍品牌:FunNow、Eatigo、Niceday、TABLEAPP。我們串聯超過 10,000 家商家,打造一個「以科技為核心、以快樂為目的」的嶄新生活享樂方式。