IT Engineer Applications – Application Performance Center

Urgent

Apply for this job

Email *
Full Name *
CV Attachment *
Browse

Upload file .pdf, .doc, .docx

Job Description

Candidates should be able to demonstrate working knowledge with the tools/technologies listed on their resumes.  This includes the ability to configure thresholds, create dashboards and alertings on APM (monitoring) tools. They should demonstrate strong analytical and critical thinking skills and be able to troubleshoot and triage issues effectively, understand business and care delivery impact, methodically identify root cause and solutions. The ideal candidate will be a strong communicator and facilitator, able to lead cross-functional meetings with engineers and business people. 

The APC team is responsible for high availability for a portfolio of external consumer facing and internal facing enterprise-wide web and mobile applications.  In order to meet this responsibility, qualified candidates must have experience and skillsets around all aspects of application performance monitoring, proactive analysis and triaging of monitoring data, lead major incident management bridges to resolution, lead RCA analysis to prevent recurrence and close any monitoring gaps and understand changes to production to ensure observability towards monitoring.  The candidate should have technical understanding of the full stack, front end and back end.  They should be able to think critically, methodically and have strong analytical and technical troubleshooting skills, including triaging issues,understand business impact and clearly articulate the incident in working with engineering and development teams.  and The candidate must prioritize incidents and ensuring quick resolution to minimize impact on members and patient care.                                                 

Essential Responsibilities

  • Prioritize incidents and ensuring quick resolution of incidents to minimize impact on members and patient care.  This includes the understanding of SLA’s, mean time to detect, business impact hours and mean time to restore services.
  • Strong analytical and technical troubleshooting skills, including triaging issues, understand business impact and clearly articulate the incident in working with engineering and development teams.
  • Completes work assignments by applying up-to-date knowledge in subject area to meet deadlines; following procedures and policies, and applying data and resources to support projects or initiatives; collaborating with others, often cross-functionally, to solve business problems; supporting the completion of priorities, deadlines, and expectations; communicating progress and information; identifying and recommending ways to address improvement opportunities when possible; and escalating issues or risks as appropriate.
  • Pursues self-development and effective relationships with others by sharing resources, information, and knowledge with coworkers and customers; listening, responding to, and seeking performance feedback; acknowledging strengths and weaknesses; assessing and responding to the needs of others; and adapting to and learning from change, difficulties, and feedback.
  • As part of the IT Engineering job family, this position is responsible for leveraging DEVOPS, and both Waterfall and Agile practices, to design, develop, and deliver resilient, secure, multi-channel, high-volume, high-transaction, on/off- premise, cloud-based solutions.
  • Provides insight into recommendations for technical solutions that meet design and functional needs.
  • Provides systems’ incident support and troubleshooting for basic to moderately complex issues.
  • Assists in identification of specific interfaces, methods, parameters, procedures, and functions, as required, to support technical solutions.
  • Supports collaboration between team members, architects, and/or software consultants to ensure functional specifications are converted into flexible, scalable, and maintainable solution designs.
  • Assists in translating business requirements and functional specifications into code modules and software solutions,with guidance from senior colleagues, by providing insight into recommendations for technical solutions that meet design and functional needs.
  • Assists in the implementation and post-implementation triage and support of business software solutions, with guidance from senior colleagues, by programming and/or configuring enhancements to new or packaged-based systems and applications.
  • Develops and executes unit testing to identify application errors and ensure software solutions meet functional specifications.
  • Assists in the development, configuration, or modification of integrated business and/or enterprise application solutions within various computing environments by designing and coding component-based applications using programming languages.
  • Writes technical specifications and documentation.
  • Assists with efforts to ensure new and existing software solutions are developed with insight into industry best practices, strategies, and architectures.
  • Assists in building partnerships with IT teams and vendors to ensure written code adheres to company architectural standards, design patterns, and technical specifications.
  • Works with vendors (e.g., offshore, application, service).
  • Candidates are required to have 3.5 years’ experience in performance monitoring, event management incident management, problem management, change management and reporting. They will ensure signaling and monitoring is in place for all functions in support of the applications in our portfolio; monitor for all events, and they will handle be responsible for reporting on availability, performance, and incident management.
  • This role requires a strong understanding of software development principles, including debugging, troubleshooting, and optimizing application performance. While the candidate will not be responsible for building or designing applications, they must apply their development knowledge to monitor, analyze, and support enterprise applications effectively. Candidates will be responsible for application performance monitoring (APM) tools, incident management, and cloud-based infrastructure is essential to ensure system reliability and efficiency.                                                          

Minimum Qualifications                                                        

  • Bachelor’s degree in Computer Science, CIS, or related field and Minimum three (3) years experience in software development or a related field.
  • Additional three (3) yearse quivalent work experience for a total of six (6) years may be substituted for the degree requirement.  3 years minimum experience supporting consumer facing large scale enterprise high availability website                               

Essential Requirements                                                         

  • Candidates are responsible for performance monitoring, event management, incident management and reporting. They will ensure signaling and monitoring is in place for all functions in support of the applications in our portfolio; monitor for all events, and they will handle reporting on availability, performance, and incident management.
  • Candidates are required to have experience with APM tools such as Dynatrace, Splunk, or any other similar tools as experienced user configuring alerts thresholds and create dashboards.
  • Candidates must have experience in creating auto recovery scripts and understand technologies for hybrid cloud solutions (AWS or Azure) running in Kubernetes.
  • Experience on troubleshooting Front End (Web Page) issues (Devtools, HTML 5, CSS3, etc.)Candidate preference is to have some experience in creating auto recovery scripts using Shell or Python and understand technologies for hybrid cloud solutions (AWS or Azure) running on Kubernetes/docker.  This includes the understanding of SLA’s, mean time to detect, business impact hours and mean time to restore services.         

Preferred Qualifications                                                                                                        

  • One (1) year of development experience with Amazon WebServices
  • One (1) year of support experience with Amazon Kubernetes Services
  • One (1) year of support experience with Android apps
  • One (1) year of support experience on Microsoft Azure
  • One (1) year of support experience Node.js
  • One (1) year of support experience with iOS apps
  • One (1) year of support experience with Java
  • One (1) year of support experience with JavaScript
  • One (1) year of support experience with Oracle
  • One (1) year of support experience with responsive UI (HTML 5, CSS3, etc.)
  • One (1) year of experience using Splunk
  • One (1) year of experience using Dynatrace
  • One (1) year of experience using BlueTriangle
  • Two (2) years of experience with engineering tools such as bug tracking and source code control systems.
  • Two (2) years of experience working in a large matrixed organization.
  • One (1) year experience working with IT vendors.
  • Two (2) years of experience working with an IT Infrastructure Library (ITIL) framework.
  • Two (2) years of experience writing technical documentation in a software development environment.
  • Two (2) years of experience working with web services.
  • One (1) year of support experience with Adobe Experience Manager
  • One (1) year of development experience with Agile Methodology
  • One (1) year of experience creating complex incident analysis and availability reports                                                          

Benefits

  • Transportation.
  • Life insurance.
  • Medical insurance.
  • Solidarity association.
  • Growth plans.
  • Additional days off.

K3