hero

Our Network

196
companies
1,944
Jobs

SRE (Site Reliability Engineer)

eToro

eToro

Software Engineering
Bnei Brak, Israel
Posted on Monday, May 13, 2024

SRE (Site Reliability Engineer)

  • Rnd
  • Bnei Brak ,Israel
  • Full-time

Description

eToro is seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our dynamic team. Your role as a SRE will be to ensure our infrastructure and applications are reliable, scalable, and perform well. You will collaborate closely with cross-functional teams to design, build, and maintain resilient systems that meet the needs of our customers and business stakeholders.

Responsibilities:

  1. Collaborate with R&D engineers on coordination, communication, and execution of production-related operations
  2. Design, implement, and maintain scalable and reliable infrastructure solutions to support our applications and services.
  3. Develop and deploy monitoring, alerting, and logging systems to proactively identify and mitigate operational issues.
  4. Build a SRE dashboard with KPI to measure eToro’s application reliability.
  5. Conduct capacity planning and performance tuning to optimize system performance and resource utilization for improved user experience.
  6. Automate repetitive tasks and processes to streamline operations and improve efficiency.
  7. Participate in incident response and resolution, including root cause analysis and post-mortem reviews.
  8. Continuously evaluate and adopt new technologies and methodologies to enhance our infrastructure and operations.
  9. Documentation and Knowledge Sharing: Create and maintain documentation, runbooks, and knowledge base articles to document system configurations, procedures, and best practices.

Requirements

  • 4+ years’ as a DevOps/SRE/Integration engineer with a passion for technology and strong motivation to build highly reliable solutions.
  • In-depth knowledge of Observability tools (Prometheus, Splunk, Data Dog, Grafana).
  • Git, Jenkins, Gitaction(preferred), Virtualization, Containers, Kubernetes.
  • Cloud providers: AWS / Azure (preferred) / GCP.
  • Excellent understanding of Linux operating systems and scripting languages (Python, Bash).
  • Strong communication skills, both verbal and written, with the ability to adapt the messaging to different perspectives (technical, business) and levels of detail.
  • Ability to grasp new technologies quickly and prioritize and multitask on multiple responsibilities
  • Excellent problem-solving skills and the ability to work effectively in a fast-paced, dynamic environment.
  • Experience with , Ansible, Terraform - an advantage