Maniphest T368597

Decommission ganeti1019
Closed, ResolvedPublic
Actions

Assigned To

Authored By

	MoritzMuehlenhoff
	Jun 27 2024, 8:46 AM

Tags

Referenced Files

None

Subscribers

MoritzMuehlenhoff

Details

Other Assignee: Jclark-ctr

	Subject	Repo	Branch	Lines +/-
	Remove ganeti1019 from list of active Ganeti nodes in eqiad	operations/puppet	production	+0 -1

Customize query in gerrit

Related Objects

Mentioned Here: T367071: ganeti1019 is down

Event Timeline

MoritzMuehlenhoff created this task.Jun 27 2024, 8:46 AM

Restricted Application added a project: DC-Ops. · View Herald TranscriptJun 27 2024, 8:46 AM

Change #1050263 had a related patch set uploaded (by Muehlenhoff; author: Muehlenhoff):

[operations/puppet@production] Remove ganeti1019 from list of active Ganeti nodes in eqiad

https://gerrit.wikimedia.org/r/1050263

gerritbot added a project: Patch-For-Review.Jun 27 2024, 8:47 AM

Change #1050263 merged by Muehlenhoff:

[operations/puppet@production] Remove ganeti1019 from list of active Ganeti nodes in eqiad

https://gerrit.wikimedia.org/r/1050263

MoritzMuehlenhoff triaged this task as Medium priority.Jun 27 2024, 8:55 AM

MoritzMuehlenhoff updated the task description. (Show Details)

cookbooks.sre.hosts.decommission executed by jmm@cumin2002 for hosts: ganeti1019.eqiad.wmnet

ganeti1019.eqiad.wmnet (FAIL)
- Host not found on Icinga, unable to downtime it
- Found physical host
- Downtimed management interface on Alertmanager
- Unable to connect to the host, wipe of swraid, partition-table and filesystem signatures will not be performed: Cumin execution failed (exit_code=2)
- Powered off
- [Netbox] Set status to Decommissioning, deleted all non-mgmt IPs, updated switch interfaces (disabled, removed vlans, etc)
- Configured the linked switch interface(s)
- Removed from DebMonitor
- Removed from Puppet master and PuppetDB

ERROR: some step on some host failed, check the bolded items above

MoritzMuehlenhoff assigned this task to Jclark-ctr.Jun 27 2024, 9:04 AM

MoritzMuehlenhoff updated the task description. (Show Details)

Maintenance_bot removed a project: Patch-For-Review.Jun 27 2024, 9:30 AM

VRiley-WMF claimed this task.Jun 27 2024, 3:16 PM

VRiley-WMF updated Other Assignee, added: Jclark-ctr.

VRiley-WMF added a subscriber: Jclark-ctr.

Removed the server and ran the decom script.