Observability & Site Reliability Engineer (SRE) Job at Artmac Soft LLC, Fort Worth, TX

dFlmbGg1QWNsVmN3Z0ZQbEI2M0d5SnNpekE9PQ==
  • Artmac Soft LLC
  • Fort Worth, TX

Job Description

Who we are:

Artmac Soft is a technology consulting and service-oriented IT company dedicated to providing innovative technology solutions and services to Customers.

Job Description:

Job Title : Observability & Site Reliability Engineer (SRE)

Job Type : W2

Experience : 5-15 Years

Location : Fort Worth, Texas

Responsibilities:

  • Experience with Dynatrace, AppMon, Zabbix, SCOM, Datadog, CloudWatch, X-Ray, and Splunk.
  • Self-motivated and able to work in a 7x24 environment.
  • Experience managing critical system outages and interacting at all organizational levels.
  • On-call support availability.
  • Proficiency in monitoring and alerting tools (e.g., Dynatrace, Datadog, CloudWatch, Splunk).
  • Strong understanding of IT infrastructure, including servers, networks, databases, and cloud environments.
  • Some Experience with incident, problem, and change management processes a plus
  • Ability to analyze complex systems and identify performance bottlenecks.
  • Excellent troubleshooting and problem-solving skills.
  • Effective communication and collaboration skills.
  • Familiarity with ITIL best practices and service management frameworks.
  • Operate in a 7-day/24-hour environment with after-hours support flexibility.
  • Collaborate with internal teams and suppliers to resolve and lead event resolution across all mission-critical IT and Telecom service levels.
  • Protect business system availability through integrated incident, problem, and change management.
  • Monitor systems for faults and optimization opportunities.
  • Assist the major incident response team and escalate critical events.
  • Evaluate and improve monitoring/alerting tools and processes.
  • Conduct technical root cause analysis and engage with management teams for internal issues.
  • Identify potential business-impacting events and manage incident processes.
  • Provide expert guidance during reviews and debriefs.
  • Analyze problem trends and monitor tools to identify chronic activity.
  • Communicate effectively with senior management.
  • System Monitoring: Implement and maintain monitoring solutions to track the performance, health, and availability of IT systems, applications, and networks.
  • Alert Management: Configure and manage alerting mechanisms to ensure timely notifications of any anomalies, failures, or performance degradations.
  • Incident Response: Collaborate with support and operations teams to analyze, resolve, and lead event resolution processes during incidents and outages.
  • Root Cause Analysis: Conduct thorough investigations to determine the root cause of incidents and implement corrective actions to prevent recurrence.
  • Optimization: Identify opportunities for system optimization and performance improvements through data analysis and trend identification.
  • Tool Evaluation and Integration: Evaluate, recommend, and integrate new monitoring and alerting tools and technologies to enhance the organization's monitoring capabilities.
  • Documentation and Reporting: Develop and maintain comprehensive documentation, including monitoring configurations, incident reports, and performance metrics.
  • Collaboration and Communication: Work closely with various IT teams, including application, infrastructure, and DevOps teams, to ensure seamless operations and effective communication during incidents.

Qualification:

  • Bachelor's degree or equivalent combination of education and experience.

Job Tags

Similar Jobs

American Ambulance FL

Dispatcher Job at American Ambulance FL

 ...Position Summary: The primary job responsibilities of the Dispatcher are to receive telephone calls requesting medical assistance and...  ...to the CBD (Criteria Based Dispatch) for individual counties. Training will be provided for employees. Obtains required information... 

The Party Staff, Inc.

Event Server Job at The Party Staff, Inc.

 ...The Party Staff, Inc. is looking to add to our growing roster of Event Servers in San Antonio, TX! Calling all hospitality professionals! The Party Staff in search of experienced event servers to join our team of fun-loving professionals. We've been providing top-... 

University of Chicago (UC)

Senior Associate Director, Donor Relations, Reporting | University of Chicago (UC) Job at University of Chicago (UC)

 ...University of Chicago is an Affirmative Action/Equal Opportunity/Disabled/Veterans Employer and does not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national or ethnic origin, age, status as an individual with a... 

Hyatt

Hotel Steward/Dishwasher Job at Hyatt

 ...place of outstanding rewards, where talent opens doors to exciting challenges in the hospitality industry. A Hotel Steward or Dishwasher is primarily responsible for maintaining the cleanliness of all hotel china, silverware, and cookware. This person must have good... 

In House Jobs | JDHuntr

In House Counsel Jobs California | JDHuntr 47410 General Counsel, New York, NY Job at In House Jobs | JDHuntr

In House Counsel Jobs California | JDHuntr 47410 General Counsel, New York, NY To apply go to JDHuntr.com Were looking for a General Counsel to lead these efforts across our legal and compliance functions. In this capacity, the General Counsel oversees and manages...