Aggie Innovation Platform Site Reliability Engineer

Updated: over 2 years ago
Location: College Station, TEXAS
Deadline: The position may have been removed or expired!

Job Title

Aggie Innovation Platform Site Reliability Engineer

Agency

Texas A&M University

Department

Division of Information Technology

Proposed Minimum Salary

Commensurate

Job Location

College Station, Texas

Job Type

Staff

Job Description

Our Commitment

Texas A&M University is committed to enriching the learning and working environment for all visitors, students, faculty, and staff by promoting a culture that embraces inclusion, diversity, equity, and accountability.  Diverse perspectives, talents, and identities are vital to accomplishing our mission and living our core values .

Who we are

The Division of Information Technology provides reliable and accessible IT services to elevate and enhance Texas A&M University. We provide IT leadership to the campus community while enabling the research, education and service mission of Texas A&M. With trusted services and innovative solutions, we are changing the technology landscape on campus.  To learn more about IT at Texas A&M University visit us at: https://it.tamu.edu/

What we want

The Senior IT Professional II (Site Reliability Engineer II) is responsible for providing technical leadership for identity management projects or services. Provides technical oversight for the application of and compliance with technical standards. May coordinate the technical activities of a support team. Completes reports and summaries for management and/or users including project status reports, problem reports, and progress summaries.

Required Education and Experience:

  • Bachelor’s degree in applicable field or equivalent combination of education and experience
  • Eight years of experience in multiple technology areas such as system administration, DevOps, collaborative software development, customer support, application support, project management, database administration, system reporting, access management, system security, and/or disaster recovery

Required Knowledge, Skills, and Abilities:

  • Must be able to work in a collaborative team environment. 
  • Ability to multi-task and work cooperatively with a diverse range of people. 
  • Must have strong interpersonal skills. 

Preferred Education and Experience:

  • Bachelor of Science degree
  • Programming experience with at least two of the following languages: Node.js, Python, Ruby, Go, or Bash. 
  • Knowledge of and experience using databases, particularly MySQL. 
  • Knowledge of and experience with data analysis. 
  • Knowledge of and experience writing REST web services 
  • K.nowledge of and experience consuming cloud web services (Azure and Google APIs in particular). 
  • Knowledge of and experience with PowerShell.
  • Knowledge of and experience with Docker, containers, and related technologies.
  • Knowledge of and experience with Kubernetes on-premise and in one or more public clouds (AWS, GCP, Azure). 
  • Experience with at least one of the following automation technologies: Chef, Ansible, and/or Puppet. 
  • Experience, including actual pull requests, with Github or Gitlab. 
  • Knowledge of and experience with CI/CD methodologies. 
  • Knowledge of and experience with Microsoft, Linux, and Mac operating systems (Windows Server 2012 & 2016, Windows 10, CentOS, Mac OS X). 
  • Knowledge and experience with Microsoft Active Directory and OpenLDAP. 
  • General familiarity with network protocols and theory (TCP/IP, UDP, ICMP, MAC addresses, IP packets, DNS, OSI layers, and load balancing, etc.). 
  • General familiarity with principles of project management and service management framework (e.g., ITIL/ITSM). 
  • Knowledge of and experience with DevOps methodologies.

Preferred Knowledge, Skills, and Abilities:

  • Advanced cross-disciplined IT skills, advanced analysis and troubleshooting/problem-solving, client relations skills, requirement assessment and analysis, project management methodology, understands context/interrelationships, and proficiency of ITIL.
  • Experience with Objectives and Key Results methodologies is highly desirable.

Preferred Licenses and Certifications:

Responsibilities:

  • Scripting - Maintains, develops, and documents scripts to maintain infrastructure services.
  • Server Administration - Provides technical guidance and oversight for server administration. Sets-up and configures large and complex servers. Develops complex system logic and configuration. Conducts complex server performance analyses and tuning. Coordinates routine audits of systems and software.
  • Problem Management - Oversees and coordinates the analysis of system logs. Coordinates and monitors the problem management process to include backup support. Troubleshoots complex network problems. Provides Tier III support.
  • Data Security - Oversees the maintenance of system security, and for protecting and recovering client data. Develops disaster recovery plans for complex systems.
  • Documentation - Oversees the process used to document server support methods, procedures, and configuration.
  • New Technology Planning, Evaluation, Deployment, and System Integration Testing - Coordinates the evaluation of new technologies. Makes recommendations based on the evaluation of new technologies for their applicability to the client’s needs. Creates, evaluates, and approves plans for the implementation of new technology deployments and system integration testing.
  • Project Planning Support - Collaborates with the project leader to develop work plans and time schedules for projects, including outlining phases, identifying personnel, and computing equipment requirements.
  • Common - May coordinate the technical activities of a project team. Completes reports and summaries for management and/or users including status reports, problem reports, progress summaries, and system utilization reports. Serves as a senior member of an information resource team responsible for setting technical direction. Performs some of the duties of a Site Reliability Engineer I. Performs other duties as assigned.
  • Professional Development - Participates in training and professional development sessions. 

In accordance with the federal contractor vaccination mandate , specific facilities at The Texas A&M System may be considered a covered contractor workplace with covered contractor employees. Therefore, successful applicants for this position may be subject to the federal mandate and will be required to be fully vaccinated against COVID-19 as a condition of employment unless an approved medical or religious accommodation is in place.

All positions are security-sensitive. Applicants are subject to a criminal history investigation, and employment is contingent upon the institution’s verification of credentials and/or other information required by the institution’s procedures, including the completion of the criminal history check.

Equal Opportunity/AffirmativeAction/Veterans/Disability Employer committed to diversity.



Similar Positions