New nvme ssd in the home server #hardware
It's been more than 5 years since I last upgraded my home server.
Today the kernel barfed out a lot of loglines which seemed to pertain to the five years and 11 weeks old nvme ssd it is running on:
pcieport 0000:00:01.3: AER: Corrected error message received from 0000:06:00.0
nvme 0000:06:00.0: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID)
nvme 0000:06:00.0: device [8086:f1a8] error status/mask=00000001/0000e000
nvme 0000:06:00.0: [ 0] RxErr (First)
Although these were all "Corrected", and although smart-log reads out:
available_spare : 100%
available_spare_threshold : 10%
percentage_used : 2%
it does also say:
power_on_hours : 45,474
and as the disk usage current is at:
Filesystem Size Used Avail Use% Mounted on
/dev/nvme0n1p1 1.9T 1.3T 608G 68% /
I am taking the opportunity to upgrade the old 2TB Intel 660p NVMe SSD to a new 4TB Samsung 990 Pro NVMe SSD. This should also give a decent boost in performance, from 1800 Mbps read/write and 220K IOPS to 7450/6900 Mbps read/write and 1400/1550K IOPS.
Currently I have the new NVMe SSD connected via a Dezen USB-thing, and I am running rsync -varSHx --progress --stats --exclude /mnt / /mnt/ to transfer everything. When it has run through, I will boot the machine in single-user mode, run the rsync again, and then switch over after adjusting the fstab and running grub-install.
Ok, that went a little less smoothly than expected.
The new SSD is 4TB, so a DOS partition table was out. I created a GPT table, not realizing that then grub needs a BIOS boot partition.
So the server was down for like 20 minutes, then up again while I managed to create the needed partition, and then down again for 40 minutes while I made the actual switch.
The commands I used were:
to transfer all the files - once while running, then once while booted in single user mode.
Then I did:
And then I shut down and swapped in the new SSD in the server.
This sort of worked, except grub was still trying the old UUID, so I booted with
root=/dev/nvme0n1p1 singleand ranupdate-grub.Then I reassembled the machine and put it back on the hat rack, where it belongs.
Only thing I had to fix so far after that was the
/usr/bin/pingbinary:as I run it as a non-privileged user in various simple monitoring checks.
- Adam Sjøgren 🕘︎ - 2024-07-15