DevOps/SRE Specialist for Scalable QR Platform

DevOps/SRE Specialist for Scalable QR Platform

DevOps/SRE Specialist for Scalable QR Platform

Upwork

Upwork

Remoto

1 day ago

No application

About

We're building a scalable QR rewards and verification platform. As the part-time DevOps/SRE Specialist, you will support infrastructure scaling and reliability for our app, focusing on automation for high-traffic operations, designing cost-optimized AWS infrastructure, managing Apache Solr for search capabilities, and optimizing PostgreSQL on RDS. Key Responsibilities - Design and implement cost-optimized AWS infrastructure from scratch, including S3 buckets, CloudFront CDN, and auto-scaling strategies (e.g., Kubernetes/ECS) for databases and servers - Set up and manage PostgreSQL on AWS RDS with read replicas, including performance tuning and optimization - Install, configure, and manage Apache Solr cluster on AWS, including schema design and query optimization - Implement AWS Application Load Balancer for high availability - Configure Redis/ElastiCache for caching - Implement monitoring and alerting (e.g., CloudWatch, Prometheus/Datadog, custom dashboards) for performance issues - Automate CI/CD pipelines (e.g., GitHub Actions, CodePipeline) for deployments - Conduct security audits and configurations (e.g., WAF, security groups, IAM roles) - Implement backup and disaster recovery procedures - Optimize cloud resources and costs (e.g., using Reserved Instances, Spot instances, rightsizing) for high-volume operations like 25M codes/month - Docker containerization and orchestration (e.g., ECS/Fargate) - Provide advisory support, handle incidents, and perform resilience testing as needed Must-Have Skills & Experience - Bachelor's in IT, Computer Science, or related field - 4-5+ years of DevOps/Infrastructure/SRE engineering experience - Expert-level AWS experience (e.g., ECS, RDS, VPC, ALB, CloudWatch), with familiarity in other cloud platforms (e.g., GCP) - Expert-level Apache Solr installation, configuration, and management - Expert-level PostgreSQL administration on AWS RDS, including performance tuning - Strong experience with AWS cost optimization strategies, including Reserved/Spot instances and rightsizing - Experience with AWS Application Load Balancer, auto-scaling, and high-availability setups - Proficiency in Infrastructure as Code (e.g., Terraform or CloudFormation) - Proficiency in Docker containerization and orchestration (e.g., ECS/Fargate or EKS) - CI/CD pipeline implementation experience (e.g., GitHub Actions, CodePipeline) - Strong Linux administration skills - Database backup and recovery strategies - Network security and VPC configuration - Monitoring and logging setup (e.g., CloudWatch, Prometheus/Datadog, ELK stack) - Proficiency in scripting (e.g., Bash/Python) and SRE tools - Experience with high-scale systems (1M+ users) - Ability to work independently on flexible schedules - Proven track record of managing AWS infrastructure at scale - Portfolio of Apache Solr deployments you've managed - Experience with PostgreSQL replication and failover - Understanding of high-availability architectures - Knowledge of database connection pooling and query optimization - Experience with SSL/TLS certificate management - Security best practices (encryption at rest/transit, secrets management) Specific Apache Solr Requirements - Design Solr schema for product catalog search - Implement Solr indexing for millions of products - Configure Solr for high-performance queries - Set up Solr replication for high availability - Optimize Solr cache settings - Implement Solr monitoring and health checks Specific PostgreSQL Requirements - Design RDS configuration for optimal performance - Set up read replicas for load distribution - Implement automated backup strategies - Configure connection pooling - Performance monitoring and slow query analysis - Database parameter tuning for workload How to Apply Please submit: 1. Your resume/CV 2. AWS architecture diagrams from past projects 3. Brief write-up of Apache Solr deployments you've managed (scale, challenges, solutions) 4. Examples of cost optimization you've achieved 5. Brief cover letter explaining your experience with: - AWS infrastructure design - Apache Solr at scale - PostgreSQL RDS management - Cost optimization strategies 6. Expected compensation range