Title: Technical Lead-Cloud & Infra Engg
Area(s) of responsibility
Server L2 Tasks (Windows + Linux)
These tasks include --
- Disk space issue resolution and remote server troubleshooting
- OS-level diagnostics and patching (Windows/Linux)
- Troubleshoot any server crashes or system errors
- User access and permission management
- Monitoring CPU, memory, and disk health
- Log collection and analysis for incidents
- Virtual server management and scaling
- Support for High availability and disaster recovery configuration
- Configure / maintain print queues on print servers
Coordinate with other teams for cross functional issues
Grafana Admin Tasks
1. User Management: Creating, updating, and deleting user accounts. Assigning roles and permissions to users.
2. Dashboard Management: Creating and managing dashboards, and visualizations.
3. Alerting: Setting up and managing alerts. Ensuring alerts are configured correctly and notifications are sent to the appropriate users.
4. System Monitoring: Monitoring the health and performance of the Grafana instance. Escalate to L2 team of client.
5. Backup and Restore: Performing regular backups of Grafana configurations and data. Restoring data from backups when necessary
6. Documentation: Maintaining documentation for Grafana configurations, dashboards, and processes. Ensuring documentation is up-to-date and accessible to users.
Skills with M/O flag are part of Specialization