User Details
- User Since
- Aug 22 2023, 3:06 PM (59 w, 4 d)
- Availability
- Available
- LDAP User
- ValerieRiley
- MediaWiki User
- VRiley-WMF [ Global Accounts ]
Thu, Oct 10
Wed, Oct 9
Thanks! I'll resolve this for now. Feel free to reopen if the issue crops up again.
Tue, Oct 8
Hey @andrea.denisse We can continue to try to troubleshoot this error. Currently, it isn't showing any hardware fault through the iDRAC. However, I know we spoke about possibility powering the unit down as well. Is there any preferrance on how we should proceed?
Thu, Oct 3
@Jclark-ctr You're right. I had a typo. It is in Rack D6 as per the request
Wed, Oct 2
Location:
D5
U31
CableID 2576
Port 30
Tue, Oct 1
@MoritzMuehlenhoff Once the server can be powered off, we will insert the RAM and when it powers back up, it should instantly recogize it. I'm ready anytime today to install the memory. Just let us know when puppetserver1001 can be powered down. Thanks!
Mon, Sep 30
Surprisingly, I have been able to locate six (6) 32 gig sticks of RAM 3200 MHz. Please let us know when we can initiate this process.
Hi @MoritzMuehlenhoff It looks like we could use snapshot1008 and snapshot1009 as stand ins for the servers. Let us know if there is any prefernce on cage or location.
Thu, Sep 26
after troubleshooting this, we had to reboot E1 managment switch. This issue should be cleared up.
After troubleshooting the cables and seeing multiple issues with other servers. It was recommended to reboot the switch. Logged it and then proceeded to reboot. It looks like this has cleard up the issue. Closing this now.
This drive has been replaced. Please let us know if there are any further issues.
Reseated cable and it seems to be communicating now. Will close this and monitor.
Mon, Sep 23
Hi! We do have a spare DIMM that we can swap at anytime for this unit. Please let us know when is the best time to proceed with this. Thanks!
Swapped out cable. Closing for now.
Hey @ABran-WMF as it turns out, we don't happen to have any 2TB to use as a replacment. However, we do have plenty of 4TB drives that should work. Is it okay to move forward with swapping it out with a 4TB drive?
After working with Dell and explaining the issue, they can confirm that there is no hardware issues in the TSR report. I did provide them the image that @Jclark-ctr provided as well. Case#: 198075128 They are continuing to believe there is something with the OS.
With this information, I'm going to reach back out to Dell.
Fri, Sep 20
Thu, Sep 19
This DIMM (B2) has been swapped out. Please let us know if any other issue crops up.
Is there an acceptable time to swap out the DIMM? We can proceed at any time.
Atempted to rebalance power.
Wed, Sep 18
Sep 12 2024
@Eevans the drives that were not listed in the group have been replaced. Please let us know if anything else is needed.
@andrea.denisse This drive has been replaced Please let us know if there are any other issues with this unit.
Sep 11 2024
After working with Dell on this issue for a while and they reviewed the logs, they don't see any issues with the Hardware. Would it be possible to reinstall the OS and we'll monitor this issue to see if anything else comes up? Also, logging in through iDrac, it doesn't show any errors at the moment.
Sep 10 2024
I have attempted a few troubleshooting steps. I have uploaded logs to Dell under SR 197398410. Awaiting results.
@ABran-WMF I'm taking a look at this. I will update with results.
ganeti1039
B2
U4
CableID 4893
Port 3
Sep 9 2024
Thank you! I appreciate it. Will be relabeling the new cable as 0325. Feel free to reach out if anything else happens.
Sep 6 2024
Ah, the blinking light did activate. I have swapped the HDD, and it should be good to go. Let us know if there is anything else we can help with. Thank you!
Sep 5 2024
Hi @BTullis we can replace this drive at any time. Although the LED on the drive isn't on, as long as we know the slot, that works for us.
This drive has been replaced! Thanks!
Sep 4 2024
Sep 3 2024
Sure, that will work for us. We will plan for it then. Thank you!
This is completed. Thank you!
Hey @MoritzMuehlenhoff , thanks for reaching out on this ticket. Thankfully, I have been able to locate a replacement disk for this unit. We can swap this disk at anytime.
Rebalanced power
Aug 29 2024
Ports: 27/28 patch to cr2-eqiad:xe-3/0/3 - Cable ID 1-8292024
Rebalanced power
Aug 27 2024
logging-sd1001
Rack E 5
U 32
CableID 20220092
Port 18
Drive has been replaced. Please let us know if there are any other issues with this drive. Thanks!
Aug 26 2024
ml-serve1009
Rack A2
U19
CableID 4897
Port 7
Aug 24 2024
Aug 23 2024
Was able to find the drive. We can replace at anytime @MatthewVernon
Aug 21 2024
Calling back into dell for this ticket. It was supposed to have 1 day shipping, however has not yet arrived.
Aug 20 2024
Sounds like a plan. Thank you! I will be at the ready.
@ayounsi I've checked the device and there doesn't seem to be any failure notifications (Physically anyway). Would it be possible to open up a RMA or Support ticket with Juniper?
As of right now, since there are no replacements. I will be closing this ticket. If a replacement is needed, feel free to open this back up or make a new ticket and we can look into what options we may have.
Aug 19 2024
plugged the port in and also reseated management cable
Worked with Dell on this, they will be shipping out a new HDD which will arrive tomorrow. (Dell ticket 196124764)
Worked with Dell on 1298. They have determined it will require a motherboard swap. Parts will be coming tomorrow. (Dell ticket 196127967)
I have attempted to rebalance this power again. I'm hopeful that this should help with the errors.