Due to an influx of spam, we have had to temporarily disable account registrations. Please write an email to accountsupport@archlinux.org, with your desired username, if you want to get access. Sorry for the inconvenience.
The error message is from introduced in ebdb20361059 x86/cpu: Provide cpu_init/parse_topology(). Is the issue still present in the upstream git tree? Is the message correct that there is a firmware bug? What is the connection with lm_sesnsors?
I have no idea how and whether this is related, but both have only been occurring since 6.9.1. Sensors from the mainboard and the graphics card are working correctly, only those from the cpu are not working.
Thanks for reporting this issue! This could possibly be a kernel regression, which should be bisected and reported upstream to the kernel developers and the regression mailing list
Are you confident to do the bisection on your own or do you need some help?
If you want we could also provide you with prebuilt kernel images for you to test
Does linux-6.8rc4.r2 produce the \[Firmware Bug\]: CPU0: Topology domain 0 shift 1 != 4message? If not it would seem both issues are introduced by later commits.
Linux FIRE 6.8.0-rc4-1-00001-g43d86e3cd9a7
Likewise. The download went absolutely fine. There were no messages like:
"[Firmware Bug]: CPU0: Topology domain 0 shift 1 != 4"
@gromit to me that confirms the error message has been triggering ever since it was introduced in ebdb20361059 x86/cpu: Provide cpu_init/parse_topology(). If it is considered a legitimate firmware error that the kernel should be warning about then perhaps ask the kernel developers to rate cap the message? Outstanding is if there is any connection with sensors.
With this kernel, from all the previous spam, only one line remained in the log:
archlinux kernel: [Firmware Bug]: CPU4: Topology domain 1 shift 7 != 6
For some reason, it was CPU4, the rest of the processor cores disappeared from the log
Build failure with the patch applied as arch/x86/xen/enlighten_pv.c uses the get_cpu_cap which the patch makes into a local static. I am trying building now with that part of the patch reversed.
On this kernel (linux-6.9.2.arch1-1.1-x86_64.pkg.tar.zst) the situation has almost improved. Except for one line in the log: archlinux kernel: [Firmware Bug]: CPU4: Topology domain 1 shift 7 != 6
@gromit could you forward the compilation failure to the mailing list thread if you can reproduce it? Perhaps also mention the failure with the adjusted patch?
@igorog please post the full output from dmesg for 6.9.1-arch1-2 and 6.9.2.arch1-1.5.
I seems like upstream thinks something might not be right with the output from kcpuid
Could you install it from the repos and re-send its output @igorog?
I have packaged it in the meantime
$ sudo pacman -Syu kcpuid# make sure that its the right kcpuid and i.e. not something in /usr/local/$ type -a kcpuid kcpuid is /sbin/kcpuidkcpuid is /bin/kcpuidkcpuid is /usr/bin/kcpuidkcpuid is /usr/sbin/kcpuid$ kcpuid -r
https://www.etallen.com/cpuid.html is the cpuid used by Fedora and Debian it does produce per cpu output for cpuid -r so I suspect this is the variant upstream wants.
I don't quite understand, if this is a firmware error, why errors in the boot log appeared only on the new 6.9 kernels. After all, on previous kernels such dmesg-spam was not observed in the logs.
PS: unfortunately, I can’t update the UEFI-BIOS firmware, since they simply aren’t available for my china laptop. ))
I don't quite understand, if this is a firmware error, why errors in the boot log appeared only on the new 6.9 kernels. After all, on previous kernels such dmesg-spam was not observed in the logs.
"Yes this is because the applied patch makes it print the firmware only once, the behaviour itself is unchanged" )))
Do you think it is possible to correct the spam situation on my configuration? Or will you have to live with this forever? Okay, this can still be overcome, but I’m afraid that the situation will get worse in further implementations of the Linux kernel. What if the system suddenly stops booting completely? ))
PS: and one more thing: it seems to me that (purely subjective feelings) that on 6.9.. the system has become a little slower.
Do you think it is possible to correct the spam situation on my configuration?
You would need to ask upstream. My understanding is the error message is printed once for each core where the topology does not match which on your system is all the E cores.