IT Server Operations Analyst
Houston, TX (100% on-site)
Duration:
initial 6-month contract (W2, not eligible for C2C)
Pay:$51-56/hr
We are seeking a highly motivated and detail-oriented IT Server Operations Analyst to join our Network Operations Center (NOC) team. This role is ideal for an IT professional who excels in a fast-paced operational environment and possesses a strong foundation in server administration, infrastructure monitoring, incident response, and operational support.
Key Responsibilities
Infrastructure Monitoring & Incident Management
- Monitor infrastructure, applications, and platform health using tools such as SolarWinds Orion, Dynatrace, AWS CloudWatch, Azure Monitor, and related monitoring solutions.
- Respond to alerts and incidents promptly, perform triage, escalate issues appropriately, and drive resolution within established SLA targets.
- Document incidents, troubleshooting activities, resolutions, and escalation paths to support root cause analysis and knowledge sharing.
- Participate in an on-call rotation to provide after-hours operational support.
- Execute maintenance window activities, including validation testing, maintenance mode verification, and alert suppression management.
Server Operations & Infrastructure Support
- Perform troubleshooting, diagnostics, and remediation for Windows and Linux servers across physical, virtual, and hybrid environments.
- Support routine server maintenance, health checks, patching activities, firmware updates, and post-maintenance validation.
- Manage and coordinate support tickets, ensuring timely communication and resolution across technical teams.
- Maintain accurate operational documentation, system configurations, procedures, and support standards.
- Support disaster recovery processes, including failover and failback activities, validation, and documentation.
- Administer and maintain DNS records using Infoblox to support infrastructure changes, application updates, and business continuity activities.
- Utilize AWS CloudWatch and Azure Monitor for infrastructure visibility, alerting, reporting, and operational monitoring.
- Partner with observability teams to improve monitoring coverage, service visibility, and application-aware troubleshooting capabilities.
Automation & Operational Excellence
- Contribute to automation initiatives that improve operational consistency, reduce manual effort, and enhance service reliability.
- Develop or maintain scripts and workflows using PowerShell, Bash, Python, Ansible, or similar automation technologies.
- Identify repetitive operational tasks and recommend automation, runbook creation, or workflow improvements.
- Support enhancements to monitoring onboarding, maintenance procedures, ticket routing processes, and operational documentation.
Communication & Reporting
- Provide clear and effective communication during shift handoffs, incident escalations, operational reviews, and stakeholder updates.
- Collaborate with infrastructure, application, cloud, and support teams to ensure alignment on service priorities and issue resolution.
- Assist in developing operational dashboards, reporting metrics, and actionable alerting strategies.
- Track and analyze operational performance metrics, recurring incidents, risks, and opportunities for continuous improvement.
Required Qualifications
- 5+ years of experience in server operations, infrastructure support, systems administration, or NOC environments.
- Strong experience supporting Windows Server and Linux operating systems.
- Hands-on experience with infrastructure monitoring, alert management, and incident response processes.
- Familiarity with monitoring and observability platforms such as SolarWinds Orion, Dynatrace, or similar tools.
- Working knowledge of AWS and/or Microsoft Azure, including CloudWatch and Azure Monitor.
- Experience supporting data center operations, hardware troubleshooting, and physical/virtual infrastructure environments.
- Strong understanding of networking fundamentals, including TCP/IP, DNS, and DHCP.
- Excellent analytical, troubleshooting, documentation, and communication skills.
- Ability to work rotating shifts, participate in on-call support, and operate effectively within a NOC environment.
Preferred Qualifications
- Experience supporting hybrid infrastructure environments spanning on-premises, AWS, and Azure platforms.
- Familiarity with enterprise observability and monitoring strategies utilizing CloudWatch, Azure Monitor, and related tools.
- Experience with automation and scripting using PowerShell, Bash, Python, Ansible, or comparable technologies.
- Knowledge of virtualization platforms such as VMware vSphere or Microsoft Hyper-V.
- Experience with ITSM platforms and ticketing systems, including Helix Remedy or similar solutions.
- Hands-on experience with Infoblox or enterprise DNS administration.
- Experience developing runbooks, playbooks, operational procedures, or automation workflows.
- Industry certifications such as Microsoft, Red Hat, AWS, Azure, CompTIA Server+, or equivalent credentials.
...