* [gentoo-user] my 5.15.93 kernel keeps rebooting @ 2023-02-14 14:08 John Covici 2023-02-14 19:08 ` Rich Freeman 0 siblings, 1 reply; 19+ messages in thread From: John Covici @ 2023-02-14 14:08 UTC (permalink / raw To: gentoo-user Hi. So, foolish me, I decided to go from a working 5.10.155 system to try latest lts of 5.15 which is 5.15.93. Compile, install went well, but the system keeps rebooting. It gets all the way and even starts the local services and then here are the last few lines, which may be relevant or not: Feb 14 06:36:31 ccs.covici.com systemd[1]: Starting local.service... Feb 14 06:36:31 ccs.covici.com systemd[1]: Starting systemd-update-utmp-runlevel.service... Feb 14 06:36:31 ccs.covici.com bash[5753]: rm: cannot remove '/etc/ppp/provider_is_up': No such file or directory Feb 14 06:36:31 ccs.covici.com systemd[1]: systemd-update-utmp-runlevel.service: Deactivated successfully. Feb 14 06:36:31 ccs.covici.com systemd[1]: Finished systemd-update-utmp-runlevel.service. -- Boot 5c394be675854680a9cb616208f374f3 -- Any trouble shooting suggestions as to what is making the system reboot? Thanks in advance. -- Your life is like a penny. You're going to lose it. The question is: How do you spend it? John Covici wb2una covici@ccs.covici.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] my 5.15.93 kernel keeps rebooting 2023-02-14 14:08 [gentoo-user] my 5.15.93 kernel keeps rebooting John Covici @ 2023-02-14 19:08 ` Rich Freeman 2023-02-14 19:54 ` John Covici 2023-02-15 14:50 ` [gentoo-user] " Grant Edwards 0 siblings, 2 replies; 19+ messages in thread From: Rich Freeman @ 2023-02-14 19:08 UTC (permalink / raw To: gentoo-user On Tue, Feb 14, 2023 at 9:08 AM John Covici <covici@ccs.covici.com> wrote: > > Hi. So, foolish me, I decided to go from a working 5.10.155 system to > try latest lts of 5.15 which is 5.15.93. Compile, install went well, > but the system keeps rebooting. It gets all the way and even starts > the local services and then here are the last few lines, which may be > relevant or not: > > Feb 14 06:36:31 ccs.covici.com systemd[1]: Starting local.service... > Feb 14 06:36:31 ccs.covici.com systemd[1]: Starting > systemd-update-utmp-runlevel.service... > Feb 14 06:36:31 ccs.covici.com bash[5753]: rm: cannot remove > '/etc/ppp/provider_is_up': No such file or directory > Feb 14 06:36:31 ccs.covici.com systemd[1]: > systemd-update-utmp-runlevel.service: Deactivated successfully. > Feb 14 06:36:31 ccs.covici.com systemd[1]: Finished > systemd-update-utmp-runlevel.service. > -- Boot 5c394be675854680a9cb616208f374f3 -- > > Any trouble shooting suggestions as to what is making the system > reboot? > Where are you getting this from, the system log/journal? This doesn't seem like a clean shutdown, so if it is a kernel PANIC I wouldn't expect the most critical info to be in the log (since it will stop syncing to protect the filesystem). The details you need probably will be displayed on the console briefly. You can also enable a network console, which will send the dmesg output continuously over UDP to another device. This won't be interrupted by a PANIC unless there is some issue with the hardware or networking stack. If you can get the final messages on dmesg and the panic core dump that would help. The other thing you can do is try to capture a kernel core dump, but that is a bit more complicated to set up. Otherwise your log is just going to say that everything was fine until it wasn't. -- Rich ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] my 5.15.93 kernel keeps rebooting 2023-02-14 19:08 ` Rich Freeman @ 2023-02-14 19:54 ` John Covici 2023-02-14 21:25 ` Rich Freeman 2023-02-15 14:50 ` [gentoo-user] " Grant Edwards 1 sibling, 1 reply; 19+ messages in thread From: John Covici @ 2023-02-14 19:54 UTC (permalink / raw To: gentoo-user On Tue, 14 Feb 2023 14:08:34 -0500, Rich Freeman wrote: > > On Tue, Feb 14, 2023 at 9:08 AM John Covici <covici@ccs.covici.com> wrote: > > > > Hi. So, foolish me, I decided to go from a working 5.10.155 system to > > try latest lts of 5.15 which is 5.15.93. Compile, install went well, > > but the system keeps rebooting. It gets all the way and even starts > > the local services and then here are the last few lines, which may be > > relevant or not: > > > > Feb 14 06:36:31 ccs.covici.com systemd[1]: Starting local.service... > > Feb 14 06:36:31 ccs.covici.com systemd[1]: Starting > > systemd-update-utmp-runlevel.service... > > Feb 14 06:36:31 ccs.covici.com bash[5753]: rm: cannot remove > > '/etc/ppp/provider_is_up': No such file or directory > > Feb 14 06:36:31 ccs.covici.com systemd[1]: > > systemd-update-utmp-runlevel.service: Deactivated successfully. > > Feb 14 06:36:31 ccs.covici.com systemd[1]: Finished > > systemd-update-utmp-runlevel.service. > > -- Boot 5c394be675854680a9cb616208f374f3 -- > > > > Any trouble shooting suggestions as to what is making the system > > reboot? > > > > Where are you getting this from, the system log/journal? This doesn't > seem like a clean shutdown, so if it is a kernel PANIC I wouldn't > expect the most critical info to be in the log (since it will stop > syncing to protect the filesystem). The details you need probably > will be displayed on the console briefly. You can also enable a > network console, which will send the dmesg output continuously over > UDP to another device. This won't be interrupted by a PANIC unless > there is some issue with the hardware or networking stack. > > If you can get the final messages on dmesg and the panic core dump > that would help. > > The other thing you can do is try to capture a kernel core dump, but > that is a bit more complicated to set up. > > Otherwise your log is just going to say that everything was fine until > it wasn't. > Thanks a lot for responding. OK, how would I set up logging to a network and what would I have to do on another computer -- which in my case is Windows? I do have a terminal program on there called teraterm which can do ssh, but that is about what I have -- unless there is some other program I can put on there. -- Your life is like a penny. You're going to lose it. The question is: How do you spend it? John Covici wb2una covici@ccs.covici.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] my 5.15.93 kernel keeps rebooting 2023-02-14 19:54 ` John Covici @ 2023-02-14 21:25 ` Rich Freeman 2023-02-14 22:04 ` John Covici 0 siblings, 1 reply; 19+ messages in thread From: Rich Freeman @ 2023-02-14 21:25 UTC (permalink / raw To: gentoo-user On Tue, Feb 14, 2023 at 2:54 PM John Covici <covici@ccs.covici.com> wrote: > > On Tue, 14 Feb 2023 14:08:34 -0500, > Rich Freeman wrote: >> > > will be displayed on the console briefly. You can also enable a > > network console, which will send the dmesg output continuously over > > UDP to another device. > > OK, how would I set up logging to a network and what would I have to > do on another computer -- which in my case is Windows? The docs are at: https://www.kernel.org/doc/Documentation/networking/netconsole.txt (you can also google for linux netconsole for some wiki articles on it) I have on my command line: netconsole=@/,6666@10.1.0.52 That IP is the host I want the log traffic to go to. (Read the docs if you have a more complicated networking setup - I assume that will just run ARP and send stuff out without using a gateway/etc.) Then on a receiving linux host I'd run (I think - it has been a while): nc -u -l -p 6666 Now, you mentioned Windows. I've never used it, but nmap has a program available in a windows version called ncat that might do the job: https://nmap.org/ncat/ You just want to make sure you have it listening on port 6666 for UDP. Make sure you use UDP or you won't receive anything. If it is working you should get a ton of log spam when your host boots - anything that shows up in dmesg will show up in the network console. It is sent in realtime. -- Rich ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] my 5.15.93 kernel keeps rebooting 2023-02-14 21:25 ` Rich Freeman @ 2023-02-14 22:04 ` John Covici 0 siblings, 0 replies; 19+ messages in thread From: John Covici @ 2023-02-14 22:04 UTC (permalink / raw To: gentoo-user On Tue, 14 Feb 2023 16:25:55 -0500, Rich Freeman wrote: > > On Tue, Feb 14, 2023 at 2:54 PM John Covici <covici@ccs.covici.com> wrote: > > > > On Tue, 14 Feb 2023 14:08:34 -0500, > > Rich Freeman wrote: > >> > > > will be displayed on the console briefly. You can also enable a > > > network console, which will send the dmesg output continuously over > > > UDP to another device. > > > > OK, how would I set up logging to a network and what would I have to > > do on another computer -- which in my case is Windows? > > The docs are at: > https://www.kernel.org/doc/Documentation/networking/netconsole.txt > > (you can also google for linux netconsole for some wiki articles on it) > > I have on my command line: netconsole=@/,6666@10.1.0.52 > > That IP is the host I want the log traffic to go to. (Read the docs > if you have a more complicated networking setup - I assume that will > just run ARP and send stuff out without using a gateway/etc.) > > Then on a receiving linux host I'd run (I think - it has been a while): > nc -u -l -p 6666 > > Now, you mentioned Windows. I've never used it, but nmap has a > program available in a windows version called ncat that might do the > job: https://nmap.org/ncat/ > > You just want to make sure you have it listening on port 6666 for UDP. > Make sure you use UDP or you won't receive anything. > > If it is working you should get a ton of log spam when your host boots > - anything that shows up in dmesg will show up in the network console. > It is sent in realtime. Sounds great -- I notice you ommitted the ip address, my network device is brought up by a systemd unit file, will I need to specify the device, then? I was thinking of netconsole=@192.168.0.1/eno1 -- would this be correct, assuming the ip address is correct? -- Your life is like a penny. You're going to lose it. The question is: How do you spend it? John Covici wb2una covici@ccs.covici.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-14 19:08 ` Rich Freeman 2023-02-14 19:54 ` John Covici @ 2023-02-15 14:50 ` Grant Edwards 2023-02-15 15:19 ` John Covici ` (2 more replies) 1 sibling, 3 replies; 19+ messages in thread From: Grant Edwards @ 2023-02-15 14:50 UTC (permalink / raw To: gentoo-user On 2023-02-14, Rich Freeman <rich0@gentoo.org> wrote: > Where are you getting this from, the system log/journal? This doesn't > seem like a clean shutdown, so if it is a kernel PANIC I wouldn't > expect the most critical info to be in the log (since it will stop > syncing to protect the filesystem). The details you need probably > will be displayed on the console briefly. You can also enable a > network console, which will send the dmesg output continuously over > UDP to another device. This won't be interrupted by a PANIC unless > there is some issue with the hardware or networking stack. If you've got a serial port[1], you could also set up serial logging. Though using serial ports have become a bit of a lost art, the serial console code in the kernel is pretty carefully designed to be the last man standing when things start to die. It's possible (though I wouldn't say probable) that a serial console will be able to show you stuff closer to the event horizon than a network console can. Anyway, since still I'm in the serial port business (yes, there are still plenty of people using serial ports in industrial settings) I had to mention it... [1] For this purpose you want a plain old UART on the motherboard type seial port. You'd be surprised how many motherboards still have them. Even though they're never brought out to a DB9 connector on the back panel, there's often an 8-pin header on the edge of the board somewhere, so you'd need one of these: https://www.amazon.com/C2G-27550-Adapter-Bracket-Motherboards/dp/B0002J27R8/ -- Grant ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-15 14:50 ` [gentoo-user] " Grant Edwards @ 2023-02-15 15:19 ` John Covici 2023-02-16 17:37 ` Laurence Perkins 2023-02-15 16:42 ` Grant Edwards 2023-02-16 11:50 ` John Covici 2 siblings, 1 reply; 19+ messages in thread From: John Covici @ 2023-02-15 15:19 UTC (permalink / raw To: gentoo-user On Wed, 15 Feb 2023 09:50:27 -0500, Grant Edwards wrote: > > On 2023-02-14, Rich Freeman <rich0@gentoo.org> wrote: > > > Where are you getting this from, the system log/journal? This doesn't > > seem like a clean shutdown, so if it is a kernel PANIC I wouldn't > > expect the most critical info to be in the log (since it will stop > > syncing to protect the filesystem). The details you need probably > > will be displayed on the console briefly. You can also enable a > > network console, which will send the dmesg output continuously over > > UDP to another device. This won't be interrupted by a PANIC unless > > there is some issue with the hardware or networking stack. > > If you've got a serial port[1], you could also set up serial > logging. Though using serial ports have become a bit of a lost art, > the serial console code in the kernel is pretty carefully designed to > be the last man standing when things start to die. It's possible > (though I wouldn't say probable) that a serial console will be able to > show you stuff closer to the event horizon than a network console can. > > Anyway, since still I'm in the serial port business (yes, there are > still plenty of people using serial ports in industrial settings) I > had to mention it... > > [1] For this purpose you want a plain old UART on the motherboard type > seial port. You'd be surprised how many motherboards still have > them. Even though they're never brought out to a DB9 connector on > the back panel, there's often an 8-pin header on the edge of the > board somewhere, so you'd need one of these: > > https://www.amazon.com/C2G-27550-Adapter-Bracket-Motherboards/dp/B0002J27R8/ I do have one which I use for my speech synthesizer. I also have one on my other box which I could hook up -- if I can find my null modem cable. I think I will try the netconsole first and the serial console if that does not work. Thanks for the hint. -- Your life is like a penny. You're going to lose it. The question is: How do you spend it? John Covici wb2una covici@ccs.covici.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* RE: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-15 15:19 ` John Covici @ 2023-02-16 17:37 ` Laurence Perkins 2023-02-17 19:03 ` John Covici ` (2 more replies) 0 siblings, 3 replies; 19+ messages in thread From: Laurence Perkins @ 2023-02-16 17:37 UTC (permalink / raw To: gentoo-user@lists.gentoo.org >-----Original Message----- >From: John Covici <covici@ccs.covici.com> >Sent: Wednesday, February 15, 2023 7:20 AM >To: gentoo-user@lists.gentoo.org >Subject: Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting > >On Wed, 15 Feb 2023 09:50:27 -0500, >Grant Edwards wrote: >> >> On 2023-02-14, Rich Freeman <rich0@gentoo.org> wrote: >> >> > Where are you getting this from, the system log/journal? This >> > doesn't seem like a clean shutdown, so if it is a kernel PANIC I >> > wouldn't expect the most critical info to be in the log (since it >> > will stop syncing to protect the filesystem). The details you need >> > probably will be displayed on the console briefly. You can also >> > enable a network console, which will send the dmesg output >> > continuously over UDP to another device. This won't be interrupted >> > by a PANIC unless there is some issue with the hardware or networking stack. >> >> If you've got a serial port[1], you could also set up serial logging. >> Though using serial ports have become a bit of a lost art, the serial >> console code in the kernel is pretty carefully designed to be the last >> man standing when things start to die. It's possible (though I >> wouldn't say probable) that a serial console will be able to show you >> stuff closer to the event horizon than a network console can. >> >> Anyway, since still I'm in the serial port business (yes, there are >> still plenty of people using serial ports in industrial settings) I >> had to mention it... >> >> [1] For this purpose you want a plain old UART on the motherboard type >> seial port. You'd be surprised how many motherboards still have >> them. Even though they're never brought out to a DB9 connector on >> the back panel, there's often an 8-pin header on the edge of the >> board somewhere, so you'd need one of these: >> >> >> https://www.amazon.com/C2G-27550-Adapter-Bracket-Motherboards/dp/B0002 >> J27R8/ > >I do have one which I use for my speech synthesizer. I also have one on my other box which I could hook up -- if I can find my null modem cable. I think I will try the netconsole first and the serial console if that does not work. > >Thanks for the hint. > > https://wiki.gentoo.org/wiki/Kernel_Crash_Dumps is another option if you're somehow not getting enough information out of the console. More complex to set up, but you can take an actual debugger to the result and hopefully find out exactly what's going on. LMP ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-16 17:37 ` Laurence Perkins @ 2023-02-17 19:03 ` John Covici 2023-02-17 20:13 ` Mark Knecht 2023-04-16 10:09 ` John Covici 2023-04-16 10:28 ` John Covici 2 siblings, 1 reply; 19+ messages in thread From: John Covici @ 2023-02-17 19:03 UTC (permalink / raw To: gentoo-user On Thu, 16 Feb 2023 12:37:51 -0500, Laurence Perkins wrote: > > > > >-----Original Message----- > >From: John Covici <covici@ccs.covici.com> > >Sent: Wednesday, February 15, 2023 7:20 AM > >To: gentoo-user@lists.gentoo.org > >Subject: Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting > > > >On Wed, 15 Feb 2023 09:50:27 -0500, > >Grant Edwards wrote: > >> > >> On 2023-02-14, Rich Freeman <rich0@gentoo.org> wrote: > >> > >> > Where are you getting this from, the system log/journal? This > >> > doesn't seem like a clean shutdown, so if it is a kernel PANIC I > >> > wouldn't expect the most critical info to be in the log (since it > >> > will stop syncing to protect the filesystem). The details you need > >> > probably will be displayed on the console briefly. You can also > >> > enable a network console, which will send the dmesg output > >> > continuously over UDP to another device. This won't be interrupted > >> > by a PANIC unless there is some issue with the hardware or networking stack. > >> > >> If you've got a serial port[1], you could also set up serial logging. > >> Though using serial ports have become a bit of a lost art, the serial > >> console code in the kernel is pretty carefully designed to be the last > >> man standing when things start to die. It's possible (though I > >> wouldn't say probable) that a serial console will be able to show you > >> stuff closer to the event horizon than a network console can. > >> > >> Anyway, since still I'm in the serial port business (yes, there are > >> still plenty of people using serial ports in industrial settings) I > >> had to mention it... > >> > >> [1] For this purpose you want a plain old UART on the motherboard type > >> seial port. You'd be surprised how many motherboards still have > >> them. Even though they're never brought out to a DB9 connector on > >> the back panel, there's often an 8-pin header on the edge of the > >> board somewhere, so you'd need one of these: > >> > >> > >> https://www.amazon.com/C2G-27550-Adapter-Bracket-Motherboards/dp/B0002 > >> J27R8/ > > > >I do have one which I use for my speech synthesizer. I also have one on my other box which I could hook up -- if I can find my null modem cable. I think I will try the netconsole first and the serial console if that does not work. > > > >Thanks for the hint. > > > > > https://wiki.gentoo.org/wiki/Kernel_Crash_Dumps is another option if you're somehow not getting enough information out of the console. More complex to set up, but you can take an actual debugger to the result and hopefully find out exactly what's going on. Well, some progress, but no joy. I found actual messages from netconsole and it seems no matter what device I put for the source, netconsole says it doesn't exist. I tried my eno1, and also eth0 and eth1. In my normal boot sequence, I see that udev renamed eth1 to eno1, but netconsole still said it does not exist. So, I may have to use the serial console method, I have to find my cables for that. I did also try to add net.ifnames=0 to my boot options, but no joy there. -- Your life is like a penny. You're going to lose it. The question is: How do you spend it? John Covici wb2una covici@ccs.covici.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-17 19:03 ` John Covici @ 2023-02-17 20:13 ` Mark Knecht 2023-02-17 21:30 ` John Covici 0 siblings, 1 reply; 19+ messages in thread From: Mark Knecht @ 2023-02-17 20:13 UTC (permalink / raw To: gentoo-user [-- Attachment #1: Type: text/plain, Size: 1508 bytes --] On Fri, Feb 17, 2023 at 12:03 PM John Covici <covici@ccs.covici.com> wrote: <SNIP> > Well, some progress, but no joy. I found actual messages from > netconsole and it seems no matter what device I put for the source, > netconsole says it doesn't exist. I tried my eno1, and also eth0 and > eth1. In my normal boot sequence, I see that udev renamed eth1 to > eno1, but netconsole still said it does not exist. So, I may have to > use the serial console method, I have to find my cables for that. I > did also try to add net.ifnames=0 to my boot options, but no joy > there. > > -- > Your life is like a penny. You're going to lose it. The question is: > How do > you spend it? > > John Covici wb2una > covici@ccs.covici.com John, I did a bad job at trying to point you in this direction the other day, and in my testing I'm not sure how well it works. However another option you might investigate is on the receiving end you can apparently set the transmitter's IP address by using the transmitter's mac address. Supposedly you would execute something like the following, with extra spaces added for readability: sudo arp -s 192.168.86.244 90:e6:ba:10:a3:e7 temp which supposedly says 'when you see a packet with this mac address associate it with this IP address'. The temp part says don't add it to the permanent tables. After executing this you are supposed to be able to use tools that filter by IP address but I didn't have great results. Hope this helps, Mark [-- Attachment #2: Type: text/html, Size: 1948 bytes --] ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-17 20:13 ` Mark Knecht @ 2023-02-17 21:30 ` John Covici 0 siblings, 0 replies; 19+ messages in thread From: John Covici @ 2023-02-17 21:30 UTC (permalink / raw To: gentoo-user My problem is that the sender aborts netconsole, so there is nothing to receive. On Fri, 17 Feb 2023 15:13:52 -0500, Mark Knecht wrote: > > [1 <text/plain; UTF-8 (7bit)>] > On Fri, Feb 17, 2023 at 12:03 PM John Covici <covici@ccs.covici.com> wrote: > <SNIP> > > Well, some progress, but no joy. I found actual messages from > > netconsole and it seems no matter what device I put for the source, > > netconsole says it doesn't exist. I tried my eno1, and also eth0 and > > eth1. In my normal boot sequence, I see that udev renamed eth1 to > > eno1, but netconsole still said it does not exist. So, I may have to > > use the serial console method, I have to find my cables for that. I > > did also try to add net.ifnames=0 to my boot options, but no joy > > there. > > > > -- > > Your life is like a penny. You're going to lose it. The question is: > > How do > > you spend it? > > > > John Covici wb2una > > covici@ccs.covici.com > > John, > I did a bad job at trying to point you in this direction the other day, > and in my testing I'm not sure how well it works. However another > option you might investigate is on the receiving end you can > apparently set the transmitter's IP address by using the > transmitter's mac address. Supposedly you would execute > something like the following, with extra spaces added > for readability: > > sudo arp -s 192.168.86.244 90:e6:ba:10:a3:e7 temp > > which supposedly says 'when you see a packet with this > mac address associate it with this IP address'. The temp > part says don't add it to the permanent tables. > > After executing this you are supposed to be able to use tools > that filter by IP address but I didn't have great results. > > Hope this helps, > Mark > [2 <text/html; UTF-8 (quoted-printable)>] -- Your life is like a penny. You're going to lose it. The question is: How do you spend it? John Covici wb2una covici@ccs.covici.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-16 17:37 ` Laurence Perkins 2023-02-17 19:03 ` John Covici @ 2023-04-16 10:09 ` John Covici 2023-04-16 10:28 ` John Covici 2 siblings, 0 replies; 19+ messages in thread From: John Covici @ 2023-04-16 10:09 UTC (permalink / raw To: gentoo-user On Thu, 16 Feb 2023 12:37:51 -0500, Laurence Perkins wrote: > > > > >-----Original Message----- > >From: John Covici <covici@ccs.covici.com> > >Sent: Wednesday, February 15, 2023 7:20 AM > >To: gentoo-user@lists.gentoo.org > >Subject: Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting > > > >On Wed, 15 Feb 2023 09:50:27 -0500, > >Grant Edwards wrote: > >> > >> On 2023-02-14, Rich Freeman <rich0@gentoo.org> wrote: > >> > >> > Where are you getting this from, the system log/journal? This > >> > doesn't seem like a clean shutdown, so if it is a kernel PANIC I > >> > wouldn't expect the most critical info to be in the log (since it > >> > will stop syncing to protect the filesystem). The details you need > >> > probably will be displayed on the console briefly. You can also > >> > enable a network console, which will send the dmesg output > >> > continuously over UDP to another device. This won't be interrupted > >> > by a PANIC unless there is some issue with the hardware or networking stack. > >> > >> If you've got a serial port[1], you could also set up serial logging. > >> Though using serial ports have become a bit of a lost art, the serial > >> console code in the kernel is pretty carefully designed to be the last > >> man standing when things start to die. It's possible (though I > >> wouldn't say probable) that a serial console will be able to show you > >> stuff closer to the event horizon than a network console can. > >> > >> Anyway, since still I'm in the serial port business (yes, there are > >> still plenty of people using serial ports in industrial settings) I > >> had to mention it... > >> > >> [1] For this purpose you want a plain old UART on the motherboard type > >> seial port. You'd be surprised how many motherboards still have > >> them. Even though they're never brought out to a DB9 connector on > >> the back panel, there's often an 8-pin header on the edge of the > >> board somewhere, so you'd need one of these: > >> > >> > >> https://www.amazon.com/C2G-27550-Adapter-Bracket-Motherboards/dp/B0002 > >> J27R8/ > > > >I do have one which I use for my speech synthesizer. I also have one on my other box which I could hook up -- if I can find my null modem cable. I think I will try the netconsole first and the serial console if that does not work. > > > >Thanks for the hint. > > > > > https://wiki.gentoo.org/wiki/Kernel_Crash_Dumps is another option if you're somehow not getting enough information out of the console. More complex to set up, but you can take an actual debugger to the result and hopefully find out exactly what's going on. So, since I could get nothing out of the net console (it kept saying that the device was not found) and my null modem connection between the computer and another box only seemed to work from the box to the computer with the problem but not the other way, I am trying to set up to get a crash dump. A few questions about this -- my root partition is zfs, whereas the article seems to use /dev/something for the root. I am using systemd, so what do I need in /etc/kexec.conf --do I put all my kernel boot parameters in that file? -- Your life is like a penny. You're going to lose it. The question is: How do you spend it? John Covici wb2una covici@ccs.covici.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-16 17:37 ` Laurence Perkins 2023-02-17 19:03 ` John Covici 2023-04-16 10:09 ` John Covici @ 2023-04-16 10:28 ` John Covici 2 siblings, 0 replies; 19+ messages in thread From: John Covici @ 2023-04-16 10:28 UTC (permalink / raw To: gentoo-user On Thu, 16 Feb 2023 12:37:51 -0500, Laurence Perkins wrote: > > > > >-----Original Message----- > >From: John Covici <covici@ccs.covici.com> > >Sent: Wednesday, February 15, 2023 7:20 AM > >To: gentoo-user@lists.gentoo.org > >Subject: Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting > > > >On Wed, 15 Feb 2023 09:50:27 -0500, > >Grant Edwards wrote: > >> > >> On 2023-02-14, Rich Freeman <rich0@gentoo.org> wrote: > >> > >> > Where are you getting this from, the system log/journal? This > >> > doesn't seem like a clean shutdown, so if it is a kernel PANIC I > >> > wouldn't expect the most critical info to be in the log (since it > >> > will stop syncing to protect the filesystem). The details you need > >> > probably will be displayed on the console briefly. You can also > >> > enable a network console, which will send the dmesg output > >> > continuously over UDP to another device. This won't be interrupted > >> > by a PANIC unless there is some issue with the hardware or networking stack. > >> > >> If you've got a serial port[1], you could also set up serial logging. > >> Though using serial ports have become a bit of a lost art, the serial > >> console code in the kernel is pretty carefully designed to be the last > >> man standing when things start to die. It's possible (though I > >> wouldn't say probable) that a serial console will be able to show you > >> stuff closer to the event horizon than a network console can. > >> > >> Anyway, since still I'm in the serial port business (yes, there are > >> still plenty of people using serial ports in industrial settings) I > >> had to mention it... > >> > >> [1] For this purpose you want a plain old UART on the motherboard type > >> seial port. You'd be surprised how many motherboards still have > >> them. Even though they're never brought out to a DB9 connector on > >> the back panel, there's often an 8-pin header on the edge of the > >> board somewhere, so you'd need one of these: > >> > >> > >> https://www.amazon.com/C2G-27550-Adapter-Bracket-Motherboards/dp/B0002 > >> J27R8/ > > > >I do have one which I use for my speech synthesizer. I also have one on my other box which I could hook up -- if I can find my null modem cable. I think I will try the netconsole first and the serial console if that does not work. > > > >Thanks for the hint. > > > > > https://wiki.gentoo.org/wiki/Kernel_Crash_Dumps is another option if you're somehow not getting enough information out of the console. More complex to set up, but you can take an actual debugger to the result and hopefully find out exactly what's going on. Second try, don't know what happened. So, after not getting any results from net console and somehow my null modem cable only seemed to work from another computer to the one with the problem kernel, I am trying to figure out how to set up for getting a crash dump. When looking at the article, it seems to want a root partition -- I use zfs, which automatically detects the root partition, so can I just forget about that one? Also I am using systemd, so there is no /etc/local.d, but I do have another location where I put commands to run after everything else has run -- do I put the start up script there? Also, there is a file /etc/conf.d/kexec.conf and I got a notice to move it to /etc/kexec.conf, what do I put there? -- Your life is like a penny. You're going to lose it. The question is: How do you spend it? John Covici wb2una covici@ccs.covici.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-15 14:50 ` [gentoo-user] " Grant Edwards 2023-02-15 15:19 ` John Covici @ 2023-02-15 16:42 ` Grant Edwards 2023-02-16 11:50 ` John Covici 2 siblings, 0 replies; 19+ messages in thread From: Grant Edwards @ 2023-02-15 16:42 UTC (permalink / raw To: gentoo-user On 2023-02-15, Grant Edwards <grant.b.edwards@gmail.com> wrote: > [1] For this purpose you want a plain old UART on the motherboard type > seial port. You'd be surprised how many motherboards still have > them. Even though they're never brought out to a DB9 connector on > the back panel, there's often an 8-pin header on the edge of the > board somewhere, so you'd need one of these: Oops, it's a 10pin (2x5) header not an 8-pin header, as I'm sure you'd have figured out. ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-15 14:50 ` [gentoo-user] " Grant Edwards 2023-02-15 15:19 ` John Covici 2023-02-15 16:42 ` Grant Edwards @ 2023-02-16 11:50 ` John Covici 2023-02-16 12:11 ` Rich Freeman 2 siblings, 1 reply; 19+ messages in thread From: John Covici @ 2023-02-16 11:50 UTC (permalink / raw To: gentoo-user On Wed, 15 Feb 2023 09:50:27 -0500, Grant Edwards wrote: > > On 2023-02-14, Rich Freeman <rich0@gentoo.org> wrote: > > > Where are you getting this from, the system log/journal? This doesn't > > seem like a clean shutdown, so if it is a kernel PANIC I wouldn't > > expect the most critical info to be in the log (since it will stop > > syncing to protect the filesystem). The details you need probably > > will be displayed on the console briefly. You can also enable a > > network console, which will send the dmesg output continuously over > > UDP to another device. This won't be interrupted by a PANIC unless > > there is some issue with the hardware or networking stack. > > If you've got a serial port[1], you could also set up serial > logging. Though using serial ports have become a bit of a lost art, > the serial console code in the kernel is pretty carefully designed to > be the last man standing when things start to die. It's possible > (though I wouldn't say probable) that a serial console will be able to > show you stuff closer to the event horizon than a network console can. > > Anyway, since still I'm in the serial port business (yes, there are > still plenty of people using serial ports in industrial settings) I > had to mention it... > > [1] For this purpose you want a plain old UART on the motherboard type > seial port. You'd be surprised how many motherboards still have > them. Even though they're never brought out to a DB9 connector on > the back panel, there's often an 8-pin header on the edge of the > board somewhere, so you'd need one of these: > > https://www.amazon.com/C2G-27550-Adapter-Bracket-Motherboards/dp/B0002J27R8/ > > Still having problems with the netconsole -- I am determined to get this working,so let me explain a bit more. The sending computer has two nics, eno1 for the internal network and eno2 is on the internet. So, my netconsole stanza said netconsole=@192.168.0.1/eno1,@192.168.0.2 The box which is at 192.168.0.2 has netcat (windows version) and I tried the following: netcat -u -v -l 192.168.0.2 6666 and I also tried 192.168.0.1 6666 which is the ip address of the linux console which I am trying to debug. I also tried 0.0.0.0 6666 which did not work either, but I think the windows firewall was blocking, and I did fix that, but did not try the 0.0.0.0 after that. So, what am I doing wrong here? -- Your life is like a penny. You're going to lose it. The question is: How do you spend it? John Covici wb2una covici@ccs.covici.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-16 11:50 ` John Covici @ 2023-02-16 12:11 ` Rich Freeman 2023-02-16 14:08 ` John Covici 0 siblings, 1 reply; 19+ messages in thread From: Rich Freeman @ 2023-02-16 12:11 UTC (permalink / raw To: gentoo-user On Thu, Feb 16, 2023 at 6:50 AM John Covici <covici@ccs.covici.com> wrote: > > The sending computer has two nics, eno1 for the internal network and > eno2 is on the internet. So, my netconsole stanza said > netconsole=@192.168.0.1/eno1,@192.168.0.2 Is CONFIG_NETCONSOLE enabled for your kernel? I'm not sure if the kernel will assign the names eno1/2 to interfaces - I think those might be assigned by udev, which probably won't have run before the kernel parses this instruction. You might need to use eth0/1 - and your guess is as good as mine which one corresponds to which. If it isn't one of those it might not hurt to put the target mac address in there just to be safe. I haven't needed that but maybe there are situations where ARP won't work (it would be needed if you are crossing subnets, in which case you'd need the gateway MAC). Keep in mind that this is a low-level function that doesn't use any routing/userspace/etc. It was designed to be robust in the event of a PANIC and to be able to be enabled fairly early during boot, so it can't rely on the sorts of things we just take for granted with networking. > > The box which is at 192.168.0.2 has netcat (windows version) and I > tried the following: > netcat -u -v -l 192.168.0.2 6666 and I also tried 192.168.0.1 6666 > which is the ip address of the linux console which I am trying to > debug. > > I also tried 0.0.0.0 6666 which did not work either, but I think the > windows firewall was blocking, and I did fix that, but did not try the > 0.0.0.0 after that. > So I'm pretty sure that netcat requires listing the destination IP, since it has to open a socket to listen on that IP. You can optionally set a source address/port in which case it will ignore anything else, but by default it will accept packets from any source. I was definitely going to suggest making sure that a windows firewall wasn't blocking the inbound connections. That's fairly default behavior on windows. -- Rich ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-16 12:11 ` Rich Freeman @ 2023-02-16 14:08 ` John Covici 2023-02-16 14:17 ` Mark Knecht 2023-02-16 14:33 ` Rich Freeman 0 siblings, 2 replies; 19+ messages in thread From: John Covici @ 2023-02-16 14:08 UTC (permalink / raw To: gentoo-user On Thu, 16 Feb 2023 07:11:12 -0500, Rich Freeman wrote: > > On Thu, Feb 16, 2023 at 6:50 AM John Covici <covici@ccs.covici.com> wrote: > > > > The sending computer has two nics, eno1 for the internal network and > > eno2 is on the internet. So, my netconsole stanza said > > netconsole=@192.168.0.1/eno1,@192.168.0.2 > > Is CONFIG_NETCONSOLE enabled for your kernel? > > I'm not sure if the kernel will assign the names eno1/2 to interfaces > - I think those might be assigned by udev, which probably won't have > run before the kernel parses this instruction. You might need to use > eth0/1 - and your guess is as good as mine which one corresponds to > which. > > If it isn't one of those it might not hurt to put the target mac > address in there just to be safe. I haven't needed that but maybe > there are situations where ARP won't work (it would be needed if you > are crossing subnets, in which case you'd need the gateway MAC). Keep > in mind that this is a low-level function that doesn't use any > routing/userspace/etc. It was designed to be robust in the event of a > PANIC and to be able to be enabled fairly early during boot, so it > can't rely on the sorts of things we just take for granted with > networking. > > > > > The box which is at 192.168.0.2 has netcat (windows version) and I > > tried the following: > > netcat -u -v -l 192.168.0.2 6666 and I also tried 192.168.0.1 6666 > > which is the ip address of the linux console which I am trying to > > debug. > > > > I also tried 0.0.0.0 6666 which did not work either, but I think the > > windows firewall was blocking, and I did fix that, but did not try the > > 0.0.0.0 after that. > > > > So I'm pretty sure that netcat requires listing the destination IP, > since it has to open a socket to listen on that IP. You can > optionally set a source address/port in which case it will ignore > anything else, but by default it will accept packets from any source. > > I was definitely going to suggest making sure that a windows firewall > wasn't blocking the inbound connections. That's fairly default > behavior on windows. hmmm, but what should I use for the source ip, I only assign those when I bring the interface up when I start the interface -- I have something like this: [Unit] Description=Network Connectivity for %i Documentation=man:ip Before=network.target Wants=network.target BindsTo=sys-subsystem-net-devices-%i.device After=sys-subsystem-net-devices-%i.device [Service] Type=oneshot RemainAfterExit=yes EnvironmentFile=/etc/conf.d/network@%i ExecStart=/bin/ip link set dev %i up ExecStart=/bin/ip addr add ${address}/${netmask} broadcast ${broadcast} dev %i ExecStart=-/bin/bash -c "test -n ${gateway} && /bin/ip route add default via ${gateway}" ExecStart=-/bin/bash -c "test -f /etc/conf.d/postup@%i.sh&&/bin/bash -c /etc/conf.d/postup@%i.sh" ExecStop=/bin/ip addr flush dev %i ExecStop=/bin/ip link set dev %i down ExecStop=-/bin/bash -c "test -f /etc/conf.d/postdown@%i.sh&&/bin/bash -c /etc/conf.d/postdown@%i.sh" [Install] WantedBy=multi-user.target and the /etc/conf.d/network@eno1 is address=192.168.0.1 netmask=24 broadcast=192.168.0.255 So, before I run this, I don't think the card has any ip address, does it? -- Your life is like a penny. You're going to lose it. The question is: How do you spend it? John Covici wb2una covici@ccs.covici.com ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-16 14:08 ` John Covici @ 2023-02-16 14:17 ` Mark Knecht 2023-02-16 14:33 ` Rich Freeman 1 sibling, 0 replies; 19+ messages in thread From: Mark Knecht @ 2023-02-16 14:17 UTC (permalink / raw To: gentoo-user [-- Attachment #1: Type: text/plain, Size: 194 bytes --] address=192.168.0.1 netmask=24 broadcast=192.168.0.255 So, before I run this, I don't think the card has any ip address, does it? From what you hope the receiving machine arp -a ? HTH, Mark [-- Attachment #2: Type: text/html, Size: 332 bytes --] ^ permalink raw reply [flat|nested] 19+ messages in thread
* Re: [gentoo-user] Re: my 5.15.93 kernel keeps rebooting 2023-02-16 14:08 ` John Covici 2023-02-16 14:17 ` Mark Knecht @ 2023-02-16 14:33 ` Rich Freeman 1 sibling, 0 replies; 19+ messages in thread From: Rich Freeman @ 2023-02-16 14:33 UTC (permalink / raw To: gentoo-user On Thu, Feb 16, 2023 at 9:08 AM John Covici <covici@ccs.covici.com> wrote: > > hmmm, but what should I use for the source ip, I only assign those > when I bring the interface up when I start the interface -- I have > something like this: > [Unit] > Description=Network Connectivity for %i > ... > So, before I run this, I don't think the card has any ip address, does > it? So, "cards" don't have an IP address. The kernel assigns an IP address to an interface, which is entirely a software construct. It happens to be a software construct that the network console feature largely ignores anyway. I didn't go reading the source code, but I'm guessing it is just constructing raw UDP packets and it will happily set the IP to whatever you want it to be. After all, it is just a field on the packet. So you can make the source IP whatever you want it to be. Just expect the packets to show up with the IP you set on them. There is no connection, so the IP doesn't need to be reachable by anything else. You could stick literally anything in there as long as some firewall isn't going to object and drop the packet. The destination IP matters because that is where it is going to go, and the interface matters because if it gets sent out on the wrong interface then obviously it won't make it there. I have no idea if the netconsole packets get seen by netfilter, but if this is on some kind of router that might be something you need to check, because if netfilter is configured to drop unassociated UDP from the firewall to the LAN that could be an issue. However, it is possible this just bypasses netfilter entirely. If you have the dynamic netconsole option enabled you could have a script update the settings after your network is up to set the source IP to the one assigned by DHCP and make sure it is on the right interface. As you point out though at boot time the interface won't have an IP. It won't even be "up," not that this is likely to bother the kernel. -- Rich ^ permalink raw reply [flat|nested] 19+ messages in thread
end of thread, other threads:[~2023-04-16 10:28 UTC | newest] Thread overview: 19+ messages (download: mbox.gz follow: Atom feed -- links below jump to the message on this page -- 2023-02-14 14:08 [gentoo-user] my 5.15.93 kernel keeps rebooting John Covici 2023-02-14 19:08 ` Rich Freeman 2023-02-14 19:54 ` John Covici 2023-02-14 21:25 ` Rich Freeman 2023-02-14 22:04 ` John Covici 2023-02-15 14:50 ` [gentoo-user] " Grant Edwards 2023-02-15 15:19 ` John Covici 2023-02-16 17:37 ` Laurence Perkins 2023-02-17 19:03 ` John Covici 2023-02-17 20:13 ` Mark Knecht 2023-02-17 21:30 ` John Covici 2023-04-16 10:09 ` John Covici 2023-04-16 10:28 ` John Covici 2023-02-15 16:42 ` Grant Edwards 2023-02-16 11:50 ` John Covici 2023-02-16 12:11 ` Rich Freeman 2023-02-16 14:08 ` John Covici 2023-02-16 14:17 ` Mark Knecht 2023-02-16 14:33 ` Rich Freeman
This is a public inbox, see mirroring instructions for how to clone and mirror all data and code used for this inbox