Summary
Overview
Work History
Education
Skills
Certification
Timeline
Generic
Kelvin Luna

Kelvin Luna

SRE / DevOps
San Salvador

Summary

Highly skilled IT professional with over 7 years of experience in roles such as Cloud Architect/Engineer, System Administrator, and DevOps/SRE, including Backend and Frontend Development. Accomplished engineer with extensive expertise in cloud monitoring, deployment, and troubleshooting. Proven track record in defining, building, and maintaining infrastructure using both vendor-neutral and platform-specific tools. Organized and focused individual with exceptional leadership skills and a commitment to continuous improvement.

Overview

8
8
years of professional experience
5
5
Certifications

Work History

DevOps Engineer

BairesDev
03.2024 - Current
  • Environment Management: Spearheaded management of multiple environments in Workato (Dev, Stage and prod) using Terraform, ensuring each environment had dedicated on-prem agents and secure token management.
    Achievement: Reduced deployment errors by 30% through consistent and repeatable infrastructure configurations.
  • AWS Infrastructure Configuration: Architected and implemented VPCs, subnets, and security groups for AWS Lambda functions using Terraform, enhancing infrastructure scalability and security.
    Achievement: Improved system scalability by 40% and reduced security vulnerabilities by implementing best practices in network segmentation and access control.
  • Monitoring and Metrics Integration: Developed and integrated comprehensive CloudWatch metrics for AWS RDS Aurora using Terraform, enabling detailed performance monitoring and proactive issue resolution.
    Achievement: Enhanced system reliability with a 25% reduction in downtime through proactive monitoring and alerting mechanisms.
  • Advanced Log Management with Terraform: Designed and configured AWS S3 buckets for efficient streaming of Cloudflare logs to Rapid7 InsightIDR, utilizing Cloudflare's Logpush module for seamless log management.
    Achievement: Improved log accessibility and analysis speed by 50%, facilitating quicker incident response and forensic investigations.

Site Reliability Engineer

P2P
1 2023 - 01.2024
  • Executed automated health check system for Kubernetes clusters that reduced mean time to repair from 2-3 hours to 5 minutes
  • Certain Services can be automatically restarted with 1 minute of downtime, without engineer intervention
  • Developed tools to automate creation of Kubernetes clusters using Terraform and implemented it into RW's CI/CD pipeline
  • Created different dashboards using Grafana and Prometheus in order to monitor cluster resource usage and performance metrics, so software engineers use to troubleshoot problems with their application
  • Documented procedures to deploy Kubernetes clusters via Helm charts and best practices for deploying applications onto Kubernetes, which 20+ SREs use in other companies
  • Developed and maintained infrastructure as code (IAC) using Ansible and Terraform to automate deployment and scaling of applications in cloud
  • Managed and optimized containerized workloads using Kubernetes, including deploying and configuring services, creating and managing pods, and setting up monitoring and logging
  • Developed custom scripts in Python and Bash to automate routine tasks and streamline processes, such as automated backups and monitoring dashboards
  • Led a team of junior engineers, providing mentorship and guidance on SRE and DevOps best practices, and participated in the hiring process.

DEVOPS / SRE

Praxent
06.2022 - 01.2023
  • Worked with different projects from a variety of fintech companies, by building the architecture, optimization, and implementation of the infrastructure
  • Worked closely with front-end and back-end developers in order to build the infrastructure needed for the application
  • Worked on scripts for CI/CD pipelines, content delivery network, database snapshot migration, application load balancer
  • Created dashboards and alerts in Grafana to keep the system healthy, monitor metrics for different applications
  • Built the infrastructure for application using Terraform, on cloud providers such as AWS, Azure
  • Building and maintaining Docker container clusters managed by Kubernetes.

Site Reliability Engineer

Avianca Holding
01.2022 - 07.2022
  • Managed Docker orchestration and containerization using Kubernetes and deployed applications in AWS EKS clusters
  • Built and maintained fully automated CI/CD pipelines to build infrastructure in AWS using Terraform and CloudFormation
  • Experience in using multiple AWS services such as - VPC, EC2, ELB, ASG, S3, IAM, RDS, Route 53, SNS etc
  • Worked on disaster recovery in AWS by moving all the services cross-account and cross-region
  • Managing logging infrastructure and alert management tools like Kibana, Dynatrace, Prometheus Grafana and OpenTelemetry
  • Utilized CloudWatch and Dynatrace to setup and monitor AWS resources such as EC2, CPU, memory, EBS volumes and set alarms for notification or automated actions
  • Building the Jenkins jobs with ArgoCD using Groovy scripts for CI/CD pipeline builds and actively involved in pipeline setups and Jenkins configurations
  • Deploying Kubernetes applications with Helm charts, expertise in creating Kubernetes configmaps, ingress and services
  • Worked on tasks like S3 cross replication, moving RDS snapshots from one account to another account, DynamoDB replication, DR migration etc
  • Part of 24
  • 7 on-call rotation with other team members, debugging the issues and fixing the issues
  • Optimization and cost reduction in AWS services by setting alerts and creating dashboards for each account, deleted EBS volumes, autoscaling and also introduced tools like zesty to the organization to reduce cost
  • Maintaining and improving site performance and reliability as part of SRE
  • Actively involved in the production releases to monitor, rollback and debug stuff until the environment is completely up and running
  • Proficiency in scripting languages like Python integrating with AWS Lambda serverless frameworks
  • Paid attention to detail while completing assignments.

VOIP NETWORK ENGINEER AND CLOUD ENGINEER

Brightlinlk
03.2020 - 01.2022
  • Established robust infrastructure and data capacity for new applications
  • Provided network support services for devices such as hubs, bridges, routers, and other hardware
  • Troubleshot complex multi-vendor network service provider issues
  • Performed troubleshooting for Juniper, Cisco, and packet analysis
  • Managed, tracked, and coordinated problem resolution and escalation processes
  • Monitored network capacity and performance to diagnose and resolve complex network problems.

SYSTEM ENGINEER

Serenova
06.2018 - 03.2020
  • Analysis of User Experience design and user-centered design (UI frameworks) to design User Interface effectively with web standards
  • Defined enterprise processes and best practices and tailored enterprise processes for applications
  • Developed WSDL for Describing Web Services
  • Implemented SOAP Web services and RESTful web services to talk with Various Applications
  • SOAP Used for Messaging and interact with Web Services
  • Played an active role in the team by interacting with business analyst and converted business requirements into system requirements
  • Involved in conducting Joint Application Development sessions and collected the end-user requirements.

NOC ENGINEER

Covestic
06.2016 - 06.2018
  • Actively involved in analysis of the system requirements specifications and involved in client interaction during requirements specifications
  • Set up and maintained applications on Amazon Web Services (AWS EC2)
  • Experience with relational databases (MySQL) and non-relational databases (Cassandra, MongoDB)
  • Involved in Database Migrations using Active Records, also involved in using Action Controller, Active Resources, Fixtures and ActionView in Rails
  • Launching the VMs on different cloud platforms and monitor the performance and configuration
  • Followed agile development methodology and scrum for the project
  • Performed Unit testing, Integration Testing, GUI and web application testing using Rspec.

Education

Master of Science - DEVOPS MASTER

Universitat Politecnica De Catalunya
09.2024

Bachelor of Science - Information Technology

Universidad Don Bosco

Skills

Cloud Platforms:

  • AWS, Azure, GCP

Containerization and Orchestration:

  • Kubernetes, Docker

Infrastructure as Code:

  • Terraform, CloudFormation

Monitoring and Logging:

  • Kibana, Grafana, Prometheus, open telemetry

Secrets Management:

  • Vault

Programming Languages:

  • Python, Golang, Nodejs

Operating Systems:

  • Linux

Databases:

  • MySQL, Aurora

Networking:

  • VoIP, Wireshark

Certification

AWS Certified Solution Architect

Timeline

DevOps Engineer

BairesDev
03.2024 - Current

DEVOPS / SRE

Praxent
06.2022 - 01.2023

Site Reliability Engineer

Avianca Holding
01.2022 - 07.2022

VOIP NETWORK ENGINEER AND CLOUD ENGINEER

Brightlinlk
03.2020 - 01.2022

SYSTEM ENGINEER

Serenova
06.2018 - 03.2020

NOC ENGINEER

Covestic
06.2016 - 06.2018

Site Reliability Engineer

P2P
1 2023 - 01.2024

Master of Science - DEVOPS MASTER

Universitat Politecnica De Catalunya

Bachelor of Science - Information Technology

Universidad Don Bosco
Kelvin LunaSRE / DevOps