[ofa-general] mlx4_core CQ overrun
Roland Dreier
roland.list at gmail.com
Thu Jun 12 17:47:07 PDT 2008
> Anybody saw mlx4_core CQ overrun before? The test is based on OFED-1.3. FW
> version is 2.3.0. Please let me know any more info is needed.
Yes, I've seen CQ overrun -- when a CQ is overrun...
> c955mgrs1:~ # dsh -av "grep 'CQ overrun' /var/log/messages" | sort
> dsh: c955c2s1.ppd.pok.ibm.com Host is not responding. No command will be
> issued to this host
> c955c1s11.ppd.pok.ibm.com: Jun 10 07:18:15 c955c1s11 kernel: mlx4_core
> 0003:01:00.0: CQ overrun on CQN 000098
What test are you running to get this error? My first guess would be
a bug in the
test that overruns a CQ.
- R.
More information about the general
mailing list