<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0" xmlns:atom="http://www.w3.org/2005/Atom">
	<channel>
		<atom:link href="http://dev1galaxy.org/extern.php?action=feed&amp;tid=3813&amp;type=rss" rel="self" type="application/rss+xml" />
		<title><![CDATA[Dev1 Galaxy Forum / Beowulf crashes often]]></title>
		<link>http://dev1galaxy.org/viewtopic.php?id=3813</link>
		<description><![CDATA[The most recent posts in Beowulf crashes often.]]></description>
		<lastBuildDate>Tue, 15 Sep 2020 07:23:25 +0000</lastBuildDate>
		<generator>FluxBB</generator>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24659#p24659</link>
			<description><![CDATA[<div class="quotebox"><cite>GlennW wrote:</cite><blockquote><div><div class="quotebox"><blockquote><div><p>zapper...<br />Does your laptop function without nvidia, if so, I recommend it. proprietary blobs have unknown consequences. Security/privacy and who knows what else.</p></div></blockquote></div><p>This post is about my desktop system, I play OTTD and CounterStrike Source. The Graphics driver helps with CS:S, but it&#039;s so old the other drivers work fine, I find it easier to manage kernel and nVidia updates manually.</p><p>The desktop system is my entertainment system, constant music (streaming 4zzz radio, and my enormous cd collection), videos, games and web-browsing.</p><p>My laptop runs the same OS, but I don&#039;t bother with the graphics modules as I only use it in emergencies and occasional couch surfing.<br />It has an intel onboard and a nVidia card, I think it&#039;s able to switch on loading... but that maybe a m$Win thing, it had win7 and 10 on it at first.</p><p>Wireless was the main reason I went to backports, to tether off my iPhone :-) (Thanks to this site I got a &quot;reliable&quot; connection happening)</p><p>Anyhow, I haven&#039;t seen any security probs with the nVidia .run installs, it seems simpler to me (than dkms)... But I keep my eyes open and never stop learning about our operating systems and desktops.</p><p>Thanks for everything, Your mileage, of course may vary.</p><p>Best regards, Glenn</p></div></blockquote></div><p>Ah, okay...&#160; &#160;Well,&#160; if it works for ya, good.&#160; I don&#039;t trust blobs myself, if I think they are related to network or have a remote weakness.&#160; But if its fine for you, by all means. That&#039;s just my view. heh...</p>]]></description>
			<author><![CDATA[dummy@example.com (zapper)]]></author>
			<pubDate>Tue, 15 Sep 2020 07:23:25 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24659#p24659</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24625#p24625</link>
			<description><![CDATA[<p>I run the proprietary NVIDIA driver without any issues on Beowulf, no lock-ups on my two computers I run this OS on (NVIDIA 970 and 760, respectively); I use the NVIDIA packages from the repo. </p><p>One thing though, if one has enabled the «backports» repo when installing the NVIDIA driver packages one might come into package conflicts; at least that was my experience a couple of times in ASCII. If this is the case, one might try removing all NVIDIA packages, disable «backports» and try again.<br />Just a suggestion.</p><p>Cheers,<br />Olav</p>]]></description>
			<author><![CDATA[dummy@example.com (F_Sauce)]]></author>
			<pubDate>Mon, 14 Sep 2020 11:45:30 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24625#p24625</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24620#p24620</link>
			<description><![CDATA[<div class="quotebox"><blockquote><div><p>zapper...<br />Does your laptop function without nvidia, if so, I recommend it. proprietary blobs have unknown consequences. Security/privacy and who knows what else.</p></div></blockquote></div><p>This post is about my desktop system, I play OTTD and CounterStrike Source. The Graphics driver helps with CS:S, but it&#039;s so old the other drivers work fine, I find it easier to manage kernel and nVidia updates manually.</p><p>The desktop system is my entertainment system, constant music (streaming 4zzz radio, and my enormous cd collection), videos, games and web-browsing.</p><p>My laptop runs the same OS, but I don&#039;t bother with the graphics modules as I only use it in emergencies and occasional couch surfing.<br />It has an intel onboard and a nVidia card, I think it&#039;s able to switch on loading... but that maybe a m$Win thing, it had win7 and 10 on it at first.</p><p>Wireless was the main reason I went to backports, to tether off my iPhone :-) (Thanks to this site I got a &quot;reliable&quot; connection happening)</p><p>Anyhow, I haven&#039;t seen any security probs with the nVidia .run installs, it seems simpler to me (than dkms)... But I keep my eyes open and never stop learning about our operating systems and desktops.</p><p>Thanks for everything, Your mileage, of course may vary.</p><p>Best regards, Glenn</p>]]></description>
			<author><![CDATA[dummy@example.com (GlennW)]]></author>
			<pubDate>Sun, 13 Sep 2020 23:09:42 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24620#p24620</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24588#p24588</link>
			<description><![CDATA[<div class="quotebox"><cite>erdos wrote:</cite><blockquote><div><p>it seems that Beowulf on my computer is crashing often</p></div></blockquote></div><p>Have you installed the CPU µcode package?</p><p><a href="https://pkginfo.devuan.org/stage/beowulf/beowulf/amd64-microcode_3.20181128.1.html" rel="nofollow">https://pkginfo.devuan.org/stage/beowul … 128.1.html</a></p><p><a href="https://pkginfo.devuan.org/stage/beowulf/beowulf-security/intel-microcode_3.20200609.2~deb10u1.html" rel="nofollow">https://pkginfo.devuan.org/stage/beowul … b10u1.html</a></p>]]></description>
			<author><![CDATA[dummy@example.com (Head_on_a_Stick)]]></author>
			<pubDate>Sat, 12 Sep 2020 17:32:17 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24588#p24588</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24581#p24581</link>
			<description><![CDATA[<div class="quotebox"><cite>GlennW wrote:</cite><blockquote><div><p>I&#039;m using backports kernel, and the newest nVidia drivers in the .run package from nVidia&#039;s website.</p><div class="codebox"><pre><code>glenn@GamesBox ~ $ inxi -F
System:    Host: GamesBox Kernel: 5.7.0-0.bpo.2-amd64 x86_64 bits: 64 Desktop: KDE Plasma 5.14.5 
           Distro: Devuan GNU/Linux 3 (beowulf) 
Machine:   Type: Desktop Mobo: ASUSTeK model: ROG STRIX X470-F GAMING v: Rev X.0x serial: &lt;root required&gt; 
           UEFI [Legacy]: American Megatrends v: 5406 date: 11/13/2019 
CPU:       Topology: 8-Core model: AMD Ryzen 7 2700X bits: 64 type: MT MCP L2 cache: 4096 KiB 
           Speed: 1913 MHz min/max: 2200/3700 MHz Core speeds (MHz): 1: 1990 2: 1979 3: 2001 4: 2055 5: 1895 6: 1937 7: 2193 
           8: 2078 9: 1964 10: 1912 11: 2032 12: 1912 13: 2190 14: 2139 15: 2045 16: 2188 
Graphics:  Device-1: NVIDIA GP106 [GeForce GTX 1060 6GB] driver: nvidia v: 450.57 
           Display: x11 server: X.Org 1.20.4 driver: nvidia resolution: 1920x1080~60Hz 
           OpenGL: renderer: GeForce GTX 1060 6GB/PCIe/SSE2 v: 4.6.0 NVIDIA 450.57 
Audio:     Device-1: NVIDIA GP106 High Definition Audio driver: snd_hda_intel 
           Device-2: Roland EDIROL UA-25EX type: USB driver: snd-usb-audio 
           Sound Server: ALSA v: k5.7.0-0.bpo.2-amd64 
Network:   Device-1: Intel Wireless-AC 9260 driver: iwlwifi </code></pre></div><p>I get occasional lockups in my web-browser Palemoon when using facebook. <br />I suspect facebook is sucking the life out of me and my computer, but I digress.</p></div></blockquote></div><p>I have only installed proprietary drivers two times on computers, on one it was an hp laptop. Debian and devuan, didn&#039;t matter, it was so buggy that even a wifi adapter usb doesn&#039;t help. Thus that particular laptop is stuck with ethernet only wifi. Why bother with a wifi blob if it only works for the first 15 mins of being booted after all... <img src="http://dev1galaxy.org/img/smilies/smile.png" width="15" height="15" alt="smile" /></p><p>The other time though I used the blob was on an old desktop pc, for booting it up.&#160; And there were no issues</p><p>Does your laptop function without nvidia, if so, I recommend it. proprietary blobs have unknown consequences. Security/privacy and who knows what else.</p><p>That&#039;s just me though, I am sure you are aware of this though.</p><p>I just wondered why anyone would take that risk.&#160; Then again I use a thinkpad x200 libreboot (Hyperbola) on the daily for certain things.</p><p>For gaming, such as wine, I use x230 thinkpad (devuan)</p><p>But that all being said, if you don&#039;t need it, I would avoid it. </p><p>I hope you figure out what to do.... <img src="http://dev1galaxy.org/img/smilies/smile.png" width="15" height="15" alt="smile" /></p><p>I suppose you could try testing version. </p><p>Just don&#039;t use ceres. <img src="http://dev1galaxy.org/img/smilies/wink.png" width="15" height="15" alt="wink" />&#160; that would be really, really dumb.</p>]]></description>
			<author><![CDATA[dummy@example.com (zapper)]]></author>
			<pubDate>Sat, 12 Sep 2020 02:56:05 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24581#p24581</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24577#p24577</link>
			<description><![CDATA[<p>I&#039;m using backports kernel, and the newest nVidia drivers in the .run package from nVidia&#039;s website.</p><div class="codebox"><pre><code>glenn@GamesBox ~ $ inxi -F
System:    Host: GamesBox Kernel: 5.7.0-0.bpo.2-amd64 x86_64 bits: 64 Desktop: KDE Plasma 5.14.5 
           Distro: Devuan GNU/Linux 3 (beowulf) 
Machine:   Type: Desktop Mobo: ASUSTeK model: ROG STRIX X470-F GAMING v: Rev X.0x serial: &lt;root required&gt; 
           UEFI [Legacy]: American Megatrends v: 5406 date: 11/13/2019 
CPU:       Topology: 8-Core model: AMD Ryzen 7 2700X bits: 64 type: MT MCP L2 cache: 4096 KiB 
           Speed: 1913 MHz min/max: 2200/3700 MHz Core speeds (MHz): 1: 1990 2: 1979 3: 2001 4: 2055 5: 1895 6: 1937 7: 2193 
           8: 2078 9: 1964 10: 1912 11: 2032 12: 1912 13: 2190 14: 2139 15: 2045 16: 2188 
Graphics:  Device-1: NVIDIA GP106 [GeForce GTX 1060 6GB] driver: nvidia v: 450.57 
           Display: x11 server: X.Org 1.20.4 driver: nvidia resolution: 1920x1080~60Hz 
           OpenGL: renderer: GeForce GTX 1060 6GB/PCIe/SSE2 v: 4.6.0 NVIDIA 450.57 
Audio:     Device-1: NVIDIA GP106 High Definition Audio driver: snd_hda_intel 
           Device-2: Roland EDIROL UA-25EX type: USB driver: snd-usb-audio 
           Sound Server: ALSA v: k5.7.0-0.bpo.2-amd64 
Network:   Device-1: Intel Wireless-AC 9260 driver: iwlwifi </code></pre></div><p>I get occasional lockups in my web-browser Palemoon when using facebook. <br />I suspect facebook is sucking the life out of me and my computer, but I digress.</p>]]></description>
			<author><![CDATA[dummy@example.com (GlennW)]]></author>
			<pubDate>Fri, 11 Sep 2020 22:05:17 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24577#p24577</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24572#p24572</link>
			<description><![CDATA[<p>The only time I had problems with devuan beowulf so far is when i switch from openrc to runit... <img src="http://dev1galaxy.org/img/smilies/hmm.png" width="15" height="15" alt="hmm" /></p>]]></description>
			<author><![CDATA[dummy@example.com (zapper)]]></author>
			<pubDate>Fri, 11 Sep 2020 19:50:45 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24572#p24572</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24564#p24564</link>
			<description><![CDATA[<div class="quotebox"><cite>larsH wrote:</cite><blockquote><div><p>Have you tried with the backports kernel and drivers. It is quite wellknown that kernel 4.19 is a bit to old to support Ryzen 2700X. And there have been reported numerous problems with that should be solved by now.</p></div></blockquote></div><div class="codebox"><pre><code># inxi -F
System:    Host: rh050 Kernel: 5.7.0-0.bpo.2-amd64 x86_64 bits: 64 Desktop: MATE 1.20.4 Distro: Devuan GNU/Linux 3 (beowulf) 
Machine:   Type: Desktop Mobo: ASUSTeK model: PRIME X470-PRO v: Rev X.0x serial: 180529428800462 UEFI: American Megatrends 
           v: 5406 date: 11/13/2019 
CPU:       Topology: 8-Core model: AMD Ryzen 7 2700X bits: 64 type: MT MCP L2 cache: 4096 KiB 
           Speed: 1915 MHz min/max: 2200/3700 MHz Core speeds (MHz): 1: 1883 2: 1890 3: 2094 4: 2080 5: 2088 6: 2106 7: 2096 
           8: 2187 9: 1889 10: 1887 11: 1884 12: 1889 13: 1890 14: 1956 15: 2126 16: 2088 
Graphics:  Device-1: Advanced Micro Devices [AMD/ATI] Ellesmere [Radeon RX 470/480] driver: amdgpu v: kernel 
           Display: server: X.Org 1.20.4 driver: amdgpu,ati unloaded: fbdev,modesetting,vesa resolution: 2560x1440~60Hz 
           OpenGL: renderer: Radeon RX 570 Series (POLARIS10 DRM 3.37.0 5.7.0-0.bpo.2-amd64 LLVM 7.0.1) v: 4.5 Mesa 18.3.6</code></pre></div><p>Also refer to <a href="https://dev1galaxy.org/viewtopic.php?id=3616" rel="nofollow">https://dev1galaxy.org/viewtopic.php?id=3616</a></p><p>rolfie</p>]]></description>
			<author><![CDATA[dummy@example.com (rolfie)]]></author>
			<pubDate>Fri, 11 Sep 2020 17:21:42 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24564#p24564</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24559#p24559</link>
			<description><![CDATA[<p>Hi</p><p>Have you tried with the backports kernel and drivers. It is quite wellknown that kernel 4.19 is a bit to old to support Ryzen 2700X. And there have been reported numerous problems with that should be solved by now.</p><p>Have a nice day<br />Lars H</p>]]></description>
			<author><![CDATA[dummy@example.com (larsH)]]></author>
			<pubDate>Fri, 11 Sep 2020 07:13:48 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24559#p24559</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24557#p24557</link>
			<description><![CDATA[<p>Jupp, my X470/Ryzen7 2700X/RX580 also crashes on me time by time. I think the RX580 is the reason:</p><div class="codebox"><pre class="vscroll"><code>Sep 10 09:43:10 rh050 kernel: [ 4404.339863] pcieport 0000:00:03.1: AER: Multiple Corrected error received: 0000:00:00.0
Sep 10 09:43:10 rh050 kernel: [ 4404.339876] pcieport 0000:00:03.1: AER: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Transmitter ID)
Sep 10 09:43:10 rh050 kernel: [ 4404.339879] pcieport 0000:00:03.1: AER:   device [1022:1453] error status/mask=00001100/00006000
Sep 10 09:43:10 rh050 kernel: [ 4404.339881] pcieport 0000:00:03.1: AER:    [ 8] Rollover              
Sep 10 09:43:10 rh050 kernel: [ 4404.339883] pcieport 0000:00:03.1: AER:    [12] Timeout               
Sep 10 09:44:38 rh050 kernel: [ 4492.979566] pcieport 0000:00:03.1: AER: Multiple Uncorrected (Non-Fatal) error received: 0000:00:00.0
Sep 10 09:44:38 rh050 kernel: [ 4492.979575] pcieport 0000:00:03.1: AER: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Receiver ID)
Sep 10 09:44:38 rh050 kernel: [ 4492.979579] pcieport 0000:00:03.1: AER:   device [1022:1453] error status/mask=00200000/04400000
Sep 10 09:44:38 rh050 kernel: [ 4492.979581] pcieport 0000:00:03.1: AER:    [21] ACSViol                (First)
Sep 10 09:44:38 rh050 kernel: [ 4492.979585] amdgpu 0000:0c:00.0: AER: can&#039;t recover (no error_detected callback)
Sep 10 09:44:38 rh050 kernel: [ 4492.979586] snd_hda_intel 0000:0c:00.1: AER: can&#039;t recover (no error_detected callback)
Sep 10 09:44:38 rh050 kernel: [ 4492.979605] pcieport 0000:00:03.1: AER: device recovery failed
Sep 10 09:44:38 rh050 kernel: [ 4492.990594] pcieport 0000:00:03.1: AER: Multiple Uncorrected (Non-Fatal) error received: 0000:00:00.0
Sep 10 09:44:38 rh050 kernel: [ 4492.990601] pcieport 0000:00:03.1: AER: PCIe Bus Error: severity=Uncorrected (Non-Fatal), type=Transaction Layer, (Receiver ID)
Sep 10 09:44:38 rh050 kernel: [ 4492.990604] pcieport 0000:00:03.1: AER:   device [1022:1453] error status/mask=00200000/04400000
Sep 10 09:44:38 rh050 kernel: [ 4492.990606] pcieport 0000:00:03.1: AER:    [21] ACSViol                (First)
Sep 10 09:44:38 rh050 kernel: [ 4492.990609] amdgpu 0000:0c:00.0: AER: can&#039;t recover (no error_detected callback)
Sep 10 09:44:38 rh050 kernel: [ 4492.990610] snd_hda_intel 0000:0c:00.1: AER: can&#039;t recover (no error_detected callback)
Sep 10 09:44:38 rh050 kernel: [ 4492.990621] pcieport 0000:00:03.1: AER: device recovery failed
Sep 10 09:44:49 rh050 kernel: [ 4503.189629] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=34669, emitted seq=34671
Sep 10 09:44:49 rh050 kernel: [ 4503.189730] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1642, emitted seq=1643
Sep 10 09:44:49 rh050 kernel: [ 4503.189832] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:44:49 rh050 kernel: [ 4503.189846] [drm:drm_atomic_helper_wait_for_flip_done [drm_kms_helper]] *ERROR* [CRTC:47:crtc-0] flip_done timed out
Sep 10 09:44:49 rh050 kernel: [ 4503.189849] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:44:49 rh050 kernel: [ 4503.189948] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:44:49 rh050 kernel: [ 4503.189951] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:44:59 rh050 kernel: [ 4503.189953] [drm] Bailing on TDR for s_job:83f6, as another already in progress
Sep 10 09:44:59 rh050 kernel: [ 4513.429441] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [CRTC:47:crtc-0] flip_done timed out
Sep 10 09:45:09 rh050 kernel: [ 4523.669346] [drm:drm_atomic_helper_wait_for_dependencies [drm_kms_helper]] *ERROR* [PLANE:45:plane-5] flip_done timed out
Sep 10 09:45:10 rh050 kernel: [ 4524.273536] amdgpu 0000:0c:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring kiq_2.1.0 test failed (-110)
Sep 10 09:45:10 rh050 kernel: [ 4524.273619] [drm:gfx_v8_0_hw_fini [amdgpu]] *ERROR* KCQ disable failed
Sep 10 09:45:10 rh050 kernel: [ 4524.548390] cp is busy, skip halt cp
Sep 10 09:45:10 rh050 kernel: [ 4524.823113] rlc is busy, skip halt rlc
Sep 10 09:45:10 rh050 kernel: [ 4524.824133] amdgpu 0000:0c:00.0: GPU BACO reset
Sep 10 09:45:11 rh050 kernel: [ 4525.125145] amdgpu 0000:0c:00.0: GPU reset succeeded, trying to resume
Sep 10 09:45:11 rh050 kernel: [ 4525.126894] [drm] PCIE GART of 256M enabled (table at 0x000000F400300000).
Sep 10 09:45:11 rh050 kernel: [ 4525.126906] [drm] VRAM is lost due to GPU reset!
Sep 10 09:45:11 rh050 kernel: [ 4525.453490] amdgpu 0000:0c:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring gfx test failed (-110)
Sep 10 09:45:11 rh050 kernel: [ 4525.453561] [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block &lt;gfx_v8_0&gt; failed -110
Sep 10 09:45:11 rh050 kernel: [ 4525.453593] amdgpu 0000:0c:00.0: GPU reset(1) failed
Sep 10 09:45:11 rh050 kernel: [ 4525.453597] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453599] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453603] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453604] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453607] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453608] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453609] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453613] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453615] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453616] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453618] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453619] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453620] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453620] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453621] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453623] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453628] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453630] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453631] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453636] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453638] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453642] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453643] amdgpu 0000:0c:00.0: GPU reset end with ret = -110
Sep 10 09:45:11 rh050 kernel: [ 4525.453650] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453654] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453658] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453659] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453662] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453667] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453673] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453676] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453680] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453682] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453685] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453686] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453687] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453688] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453689] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453690] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453691] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453692] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453693] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453695] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453696] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453698] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453699] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453700] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453702] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453704] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453705] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453706] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453706] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453707] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453708] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453710] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453711] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453712] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453714] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453715] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453716] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453718] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453719] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453720] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453721] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453722] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453723] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453724] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453726] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453727] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453728] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453730] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453731] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453732] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453735] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453736] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453737] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453738] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453739] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453741] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453742] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453742] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453743] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453745] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453747] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453748] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453749] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453750] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453751] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453754] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453755] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453756] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453758] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453758] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453759] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453760] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453762] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453763] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453765] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453766] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453767] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453768] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453771] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453774] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453775] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453777] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453781] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453784] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453787] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.453791] [drm] Skip scheduling IBs!
Sep 10 09:45:11 rh050 kernel: [ 4525.454036] [drm] scheduler sdma0 is not ready, skipping
Sep 10 09:45:11 rh050 kernel: [ 4525.454037] [drm] scheduler sdma1 is not ready, skipping
Sep 10 09:45:11 rh050 kernel: [ 4525.454114] [drm:amdgpu_gem_va_ioctl [amdgpu]] *ERROR* Couldn&#039;t update BO_VA (-2)
Sep 10 09:45:11 rh050 kernel: [ 4525.454145] BUG: kernel NULL pointer dereference, address: 0000000000000008
Sep 10 09:45:11 rh050 kernel: [ 4525.454147] #PF: supervisor read access in kernel mode
Sep 10 09:45:11 rh050 kernel: [ 4525.454148] #PF: error_code(0x0000) - not-present page
Sep 10 09:45:11 rh050 kernel: [ 4525.454150] PGD 0 P4D 0 
Sep 10 09:45:11 rh050 kernel: [ 4525.454153] Oops: 0000 [#1] SMP NOPTI
Sep 10 09:45:11 rh050 kernel: [ 4525.454156] CPU: 9 PID: 4151 Comm: Xorg Tainted: G           OE     5.7.0-0.bpo.2-amd64 #1 Debian 5.7.10-1~bpo10+1
Sep 10 09:45:11 rh050 kernel: [ 4525.454157] Hardware name: System manufacturer System Product Name/PRIME X470-PRO, BIOS 5406 11/13/2019
Sep 10 09:45:11 rh050 kernel: [ 4525.454234] RIP: 0010:amdgpu_vm_sdma_commit+0x50/0x1f0 [amdgpu]
Sep 10 09:45:11 rh050 kernel: [ 4525.454237] Code: 8b 47 20 80 7f 10 00 48 8b a8 88 01 00 00 48 8b 47 08 4c 8d a0 e0 00 00 00 75 07 4c 8d a0 98 01 00 00 49 8b 44 24 10 8b 55 08 &lt;48&gt; 8b 40 08 48 8d 78 88 85 d2 0f 84 42 01 00 00 48 8b 40 90 48 89
Sep 10 09:45:11 rh050 kernel: [ 4525.454239] RSP: 0018:ffffaeb301cebb50 EFLAGS: 00010246
Sep 10 09:45:11 rh050 kernel: [ 4525.454240] RAX: 0000000000000000 RBX: ffffaeb301cebba0 RCX: 0000000000108000
Sep 10 09:45:11 rh050 kernel: [ 4525.454242] RDX: 0000000000000080 RSI: ffffaeb301cebc48 RDI: ffffaeb301cebba0
Sep 10 09:45:11 rh050 kernel: [ 4525.454243] RBP: ffff9da25c513de8 R08: ffff9da22b80e6c8 R09: 0000000000000000
Sep 10 09:45:11 rh050 kernel: [ 4525.454244] R10: 000000000000007d R11: 0000000000000000 R12: ffff9da2727d8198
Sep 10 09:45:11 rh050 kernel: [ 4525.454245] R13: ffffaeb301cebc48 R14: 0000000000107600 R15: 0000000000000002
Sep 10 09:45:11 rh050 kernel: [ 4525.454247] FS:  00007fc5e6855a80(0000) GS:ffff9da27ea40000(0000) knlGS:0000000000000000
Sep 10 09:45:11 rh050 kernel: [ 4525.454248] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 10 09:45:11 rh050 kernel: [ 4525.454249] CR2: 0000000000000008 CR3: 00000007f5370000 CR4: 00000000003406e0
Sep 10 09:45:11 rh050 kernel: [ 4525.454250] Call Trace:
Sep 10 09:45:11 rh050 kernel: [ 4525.454327]  amdgpu_vm_bo_update_mapping+0x1d4/0x200 [amdgpu]
Sep 10 09:45:11 rh050 kernel: [ 4525.454403]  amdgpu_vm_clear_freed+0xe8/0x230 [amdgpu]
Sep 10 09:45:11 rh050 kernel: [ 4525.454477]  amdgpu_gem_va_ioctl+0x3c4/0x4d0 [amdgpu]
Sep 10 09:45:11 rh050 kernel: [ 4525.454551]  ? amdgpu_gem_va_map_flags+0x60/0x60 [amdgpu]
Sep 10 09:45:11 rh050 kernel: [ 4525.454567]  drm_ioctl_kernel+0xac/0xf0 [drm]
Sep 10 09:45:11 rh050 kernel: [ 4525.454583]  drm_ioctl+0x201/0x3a0 [drm]
Sep 10 09:45:11 rh050 kernel: [ 4525.454655]  ? amdgpu_gem_va_map_flags+0x60/0x60 [amdgpu]
Sep 10 09:45:11 rh050 kernel: [ 4525.454725]  amdgpu_drm_ioctl+0x49/0x80 [amdgpu]
Sep 10 09:45:11 rh050 kernel: [ 4525.454730]  ksys_ioctl+0x86/0xc0
Sep 10 09:45:11 rh050 kernel: [ 4525.454732]  __x64_sys_ioctl+0x16/0x20
Sep 10 09:45:11 rh050 kernel: [ 4525.454736]  do_syscall_64+0x52/0x170
Sep 10 09:45:11 rh050 kernel: [ 4525.454740]  entry_SYSCALL_64_after_hwframe+0x44/0xa9
Sep 10 09:45:11 rh050 kernel: [ 4525.454742] RIP: 0033:0x7fc5e6f6f427
Sep 10 09:45:11 rh050 kernel: [ 4525.454744] Code: 00 00 90 48 8b 05 69 aa 0c 00 64 c7 00 26 00 00 00 48 c7 c0 ff ff ff ff c3 66 2e 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 &lt;48&gt; 3d 01 f0 ff ff 73 01 c3 48 8b 0d 39 aa 0c 00 f7 d8 64 89 01 48
Sep 10 09:45:11 rh050 kernel: [ 4525.454745] RSP: 002b:00007ffe16824678 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Sep 10 09:45:11 rh050 kernel: [ 4525.454747] RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007fc5e6f6f427
Sep 10 09:45:11 rh050 kernel: [ 4525.454748] RDX: 00007ffe168246c0 RSI: 00000000c0286448 RDI: 000000000000000e
Sep 10 09:45:11 rh050 kernel: [ 4525.454749] RBP: 00007ffe168246c0 R08: 0000000107600000 R09: 000000000000000e
Sep 10 09:45:11 rh050 kernel: [ 4525.454750] R10: 0000000000000044 R11: 0000000000000246 R12: 00000000c0286448
Sep 10 09:45:11 rh050 kernel: [ 4525.454751] R13: 000000000000000e R14: 0000000000000002 R15: 000056228fa96960
Sep 10 09:45:11 rh050 kernel: [ 4525.454753] Modules linked in: rpcsec_gss_krb5 nfsv4 dns_resolver bnep bluetooth drbg ansi_cprng ecdh_generic ecc vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) parport_pc ppdev lp parport nfsd auth_rpcgss nfs_acl nfs lockd grace fscache sunrpc efivarfs fuse nls_ascii nls_cp437 vfat fat pktcdvd snd_hda_codec_realtek edac_mce_amd amdgpu snd_hda_codec_generic ledtrig_audio gpu_sched snd_hda_codec_hdmi kvm_amd ttm snd_hda_intel drm_kms_helper snd_intel_dspcfg ftdi_sio joydev eeepc_wmi cec asus_wmi kvm usbserial evdev snd_hda_codec drm battery snd_hda_core sparse_keymap irqbypass rfkill snd_hwdep video snd_pcm wmi_bmof efi_pstore pcspkr snd_timer efivars ccp snd sp5100_tco mxm_wmi k10temp watchdog mfd_core soundcore rng_core button acpi_cpufreq ext4 crc16 mbcache jbd2 crc32c_generic algif_skcipher af_alg dm_crypt dm_mod sr_mod cdrom uas usb_storage sg hid_generic usbhid hid sd_mod crc32_pclmul crc32c_intel ghash_clmulni_intel ohci_pci aesni_intel ahci libaes libahci xhci_pci crypto_simd xhci_hcd
Sep 10 09:45:11 rh050 kernel: [ 4525.454790]  ohci_hcd ehci_pci libata ehci_hcd igb cryptd glue_helper scsi_mod nvme i2c_piix4 usbcore i2c_algo_bit nvme_core dca ptp pps_core t10_pi crc_t10dif crct10dif_generic crct10dif_pclmul usb_common crct10dif_common wmi gpio_amdpt gpio_generic
Sep 10 09:45:11 rh050 kernel: [ 4525.454802] CR2: 0000000000000008
Sep 10 09:45:11 rh050 kernel: [ 4525.454804] ---[ end trace f9443a0822a20086 ]---
Sep 10 09:45:11 rh050 kernel: [ 4525.676451] RIP: 0010:amdgpu_vm_sdma_commit+0x50/0x1f0 [amdgpu]
Sep 10 09:45:11 rh050 kernel: [ 4525.676455] Code: 8b 47 20 80 7f 10 00 48 8b a8 88 01 00 00 48 8b 47 08 4c 8d a0 e0 00 00 00 75 07 4c 8d a0 98 01 00 00 49 8b 44 24 10 8b 55 08 &lt;48&gt; 8b 40 08 48 8d 78 88 85 d2 0f 84 42 01 00 00 48 8b 40 90 48 89
Sep 10 09:45:11 rh050 kernel: [ 4525.676456] RSP: 0018:ffffaeb301cebb50 EFLAGS: 00010246
Sep 10 09:45:11 rh050 kernel: [ 4525.676458] RAX: 0000000000000000 RBX: ffffaeb301cebba0 RCX: 0000000000108000
Sep 10 09:45:11 rh050 kernel: [ 4525.676459] RDX: 0000000000000080 RSI: ffffaeb301cebc48 RDI: ffffaeb301cebba0
Sep 10 09:45:11 rh050 kernel: [ 4525.676460] RBP: ffff9da25c513de8 R08: ffff9da22b80e6c8 R09: 0000000000000000
Sep 10 09:45:11 rh050 kernel: [ 4525.676461] R10: 000000000000007d R11: 0000000000000000 R12: ffff9da2727d8198
Sep 10 09:45:11 rh050 kernel: [ 4525.676462] R13: ffffaeb301cebc48 R14: 0000000000107600 R15: 0000000000000002
Sep 10 09:45:11 rh050 kernel: [ 4525.676464] FS:  00007fc5e6855a80(0000) GS:ffff9da27ea40000(0000) knlGS:0000000000000000
Sep 10 09:45:11 rh050 kernel: [ 4525.676465] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 10 09:45:11 rh050 kernel: [ 4525.676466] CR2: 0000000000000008 CR3: 00000007f5370000 CR4: 00000000003406e0
Sep 10 09:45:13 rh050 hddtemp[4090]: /dev/sda: CT500MX500SSD1: 33 C
Sep 10 09:45:13 rh050 hddtemp[4090]: /dev/sdb: Crucial_CT1050MX300SSD1: 26 C
Sep 10 09:45:13 rh050 hddtemp[4090]: /dev/sdc: Crucial_CT512MX100SSD1: 31 C
Sep 10 09:45:13 rh050 hddtemp[4090]: /dev/sde: CT2000MX500SSD1: 33 C
Sep 10 09:45:13 rh050 hddtemp[4090]: /dev/sdf: WDC WD4001FFSX-68JNUN0: 39 C
Sep 10 09:45:21 rh050 kernel: [ 4535.701511] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1643, emitted seq=1643
Sep 10 09:45:21 rh050 kernel: [ 4535.701602] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma0 timeout, signaled seq=34671, emitted seq=34671
Sep 10 09:45:21 rh050 kernel: [ 4535.701692] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process  pid 0 thread  pid 0
Sep 10 09:45:21 rh050 kernel: [ 4535.701783] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:45:21 rh050 kernel: [ 4535.701784] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:45:21 rh050 kernel: [ 4535.701790] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:45:31 rh050 kernel: [ 4535.701792] [drm] Bailing on TDR for s_job:4cd, as another already in progress
Sep 10 09:45:31 rh050 kernel: [ 4545.941531] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1643, emitted seq=1643
Sep 10 09:45:31 rh050 kernel: [ 4545.941622] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:45:31 rh050 kernel: [ 4545.941630] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:45:42 rh050 kernel: [ 4545.941632] [drm] Bailing on TDR for s_job:4ce, as another already in progress
Sep 10 09:45:42 rh050 kernel: [ 4556.181492] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1643, emitted seq=1643
Sep 10 09:45:42 rh050 kernel: [ 4556.181583] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:45:42 rh050 kernel: [ 4556.181591] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:45:52 rh050 kernel: [ 4556.181594] [drm] Bailing on TDR for s_job:4cf, as another already in progress
Sep 10 09:45:52 rh050 kernel: [ 4566.421393] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1643, emitted seq=1643
Sep 10 09:45:52 rh050 kernel: [ 4566.421485] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:45:52 rh050 kernel: [ 4566.421493] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:46:02 rh050 kernel: [ 4566.421495] [drm] Bailing on TDR for s_job:4d0, as another already in progress
Sep 10 09:46:02 rh050 kernel: [ 4576.661299] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1643, emitted seq=1643
Sep 10 09:46:02 rh050 kernel: [ 4576.661390] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:46:02 rh050 kernel: [ 4576.661398] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:46:12 rh050 kernel: [ 4576.661401] [drm] Bailing on TDR for s_job:4d1, as another already in progress
Sep 10 09:46:12 rh050 kernel: [ 4586.901180] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1643, emitted seq=1643
Sep 10 09:46:12 rh050 kernel: [ 4586.901272] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:46:12 rh050 kernel: [ 4586.901280] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:46:23 rh050 kernel: [ 4586.901283] [drm] Bailing on TDR for s_job:4d2, as another already in progress
Sep 10 09:46:23 rh050 kernel: [ 4597.141105] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1643, emitted seq=1643
Sep 10 09:46:23 rh050 kernel: [ 4597.141197] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:46:23 rh050 kernel: [ 4597.141204] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:46:33 rh050 kernel: [ 4597.141207] [drm] Bailing on TDR for s_job:4d3, as another already in progress
Sep 10 09:46:33 rh050 kernel: [ 4607.381004] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1643, emitted seq=1643
Sep 10 09:46:33 rh050 kernel: [ 4607.381097] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:46:33 rh050 kernel: [ 4607.381104] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:46:43 rh050 kernel: [ 4607.381106] [drm] Bailing on TDR for s_job:4d4, as another already in progress
Sep 10 09:46:43 rh050 kernel: [ 4617.620748] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1643, emitted seq=1643
Sep 10 09:46:43 rh050 kernel: [ 4617.620840] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:46:43 rh050 kernel: [ 4617.620847] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 09:46:53 rh050 kernel: [ 4617.620850] [drm] Bailing on TDR for s_job:4d5, as another already in progress
Sep 10 09:46:53 rh050 kernel: [ 4627.860707] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=1643, emitted seq=1643
Sep 10 09:46:53 rh050 kernel: [ 4627.860799] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 4151 thread Xorg:cs0 pid 4375
Sep 10 09:46:53 rh050 kernel: [ 4627.860807] amdgpu 0000:0c:00.0: GPU reset begin!
Sep 10 10:00:13 rh050 hddtemp[4090]: /dev/sda: CT500MX500SSD1: 33 C
Sep 10 10:00:13 rh050 hddtemp[4090]: /dev/sdb: Crucial_CT1050MX300SSD1: 26 C
Sep 10 10:00:13 rh050 hddtemp[4090]: /dev/sdc: Crucial_CT512MX100SSD1: 39 C
Sep 10 10:00:13 rh050 hddtemp[4090]: /dev/sde: CT2000MX500SSD1: 33 C
Sep 10 10:00:13 rh050 hddtemp[4090]: /dev/sdf: WDC WD4001FFSX-68JNUN0: 39 C</code></pre></div><p>Same issue before with ASCII. Does not happen every day, more once a month, but its not nice. Looks like the underground processes like hddtemp still are working, but the computer isn&#039;t usable any more, no reaction to mouse and keyboard.</p><p>I don&#039;t blame Beowulf, its a HW problem.</p><p>rolfie</p>]]></description>
			<author><![CDATA[dummy@example.com (rolfie)]]></author>
			<pubDate>Thu, 10 Sep 2020 20:44:46 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24557#p24557</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24556#p24556</link>
			<description><![CDATA[<div class="quotebox"><cite>erdos wrote:</cite><blockquote><div><p>I use nvidia proprietary driver installed through &#039;synaptics&#039; on this computer.&#160; i believe it&#039;s a 9300 series card from nvidia.</p></div></blockquote></div><p>Perhaps you could try the open source nouveau driver, just to help narrow things down. I gave up on Nvidia when my last Nvidia card died a few years ago, but there seems to be no end of glitches and trouble and version mismatches and whatnot with the proprietary drivers. Hopefully an Nvidia expert will chime in...</p>]]></description>
			<author><![CDATA[dummy@example.com (sgage)]]></author>
			<pubDate>Thu, 10 Sep 2020 18:58:26 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24556#p24556</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24554#p24554</link>
			<description><![CDATA[<p>I use nvidia proprietary driver installed through &#039;synaptics&#039; on this computer.&#160; i believe it&#039;s a 9300 series card from nvidia.</p>]]></description>
			<author><![CDATA[dummy@example.com (erdos)]]></author>
			<pubDate>Thu, 10 Sep 2020 17:16:31 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24554#p24554</guid>
		</item>
		<item>
			<title><![CDATA[Re: Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24553#p24553</link>
			<description><![CDATA[<div class="quotebox"><cite>erdos wrote:</cite><blockquote><div><p>hi,&#160; it seems that Beowulf on my computer is crashing often.&#160; typically the screen would freeze and keyboard is not responsive.&#160; I use Beowulf on my HTPC and usually firefox and kodi are opened on the desktop.&#160; It&#039;s installed on a HP computer with Nvidia card and solid state HD.</p><p>anyone else has beowulf crashing issues?</p></div></blockquote></div><p>No such problems here. In fact, Beowulf has been totally reliable on my system.&#160; I, too, have an HP desktop with an SSD, but I run just the integrated Intel graphics. It wouldn&#039;t surprise me if your issue was Nvidia related. Are you using nouveau or the proprietary drivers?</p>]]></description>
			<author><![CDATA[dummy@example.com (sgage)]]></author>
			<pubDate>Thu, 10 Sep 2020 16:33:22 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24553#p24553</guid>
		</item>
		<item>
			<title><![CDATA[Beowulf crashes often]]></title>
			<link>http://dev1galaxy.org/viewtopic.php?pid=24552#p24552</link>
			<description><![CDATA[<p>hi,&#160; it seems that Beowulf on my computer is crashing often.&#160; typically the screen would freeze and keyboard is not responsive.&#160; I use Beowulf on my HTPC and usually firefox and kodi are opened on the desktop.&#160; It&#039;s installed on a HP computer with Nvidia card and solid state HD.</p><p>anyone else has beowulf crashing issues?</p>]]></description>
			<author><![CDATA[dummy@example.com (erdos)]]></author>
			<pubDate>Thu, 10 Sep 2020 16:09:36 +0000</pubDate>
			<guid>http://dev1galaxy.org/viewtopic.php?pid=24552#p24552</guid>
		</item>
	</channel>
</rss>
