[ofa-general] [Bug 485] New: creating & deleting a subinterface with a bad pkey crashs the kernel: NULL pointer reference
bugzilla-daemon at lists.openfabrics.org
bugzilla-daemon at lists.openfabrics.org
Thu Mar 22 09:49:18 PDT 2007
https://bugs.openfabrics.org/show_bug.cgi?id=485
Summary: creating & deleting a subinterface with a bad pkey
crashs the kernel: NULL pointer reference
Product: OpenFabrics Linux
Version: 1.2beta1
Platform: X86
OS/Version: RHEL 4
Status: NEW
Severity: normal
Priority: P1
Component: IB Core
AssignedTo: bugzilla at openib.org
ReportedBy: Philippe.Gregoire at cea.fr
Creating and deleting a subinterface with a pkey partition which
do not include the node yields into a panic :
Mar 22 18:19:07 cors118 kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000054
Mar 22 18:19:07 cors118 kernel: printing eip:
Mar 22 18:19:07 cors118 kernel: c02d3325
Mar 22 18:19:07 cors118 kernel: *pde = 2cbd1001
Mar 22 18:19:07 cors118 kernel: Oops: 0000 [#1]
Mar 22 18:19:07 cors118 kernel: SMP
Mar 22 18:19:07 cors118 kernel: Modules linked in: parport_pc lp parport
autofs4 i2c_dev i2c_core nfs lockd nfs_acl sunrpc rdma_ucm(U) ib_sdp(U)
rdma_cm(U) iw_cm(U) ib_addr(U) ib_local_s
a(U) ib_ipoib(U) md5 ipv6 ide_dump cciss_dump scsi_dump diskdump zlib_deflate
dm_mirror dm_mod button battery ac ohci_hcd ib_mthca(U) ib_umad(U) ib_ucm(U)
ib_uverbs(U) ib_cm(U) ib_sa(U)
ib_mad(U) ib_core(U) tg3 floppy ext3 jbd lpfc scsi_transport_fc cciss sd_mod
scsi_mod
Mar 22 18:19:07 cors118 kernel: CPU: 3
Mar 22 18:19:07 cors118 kernel: EIP: 0060:[<c02d3325>] Not tainted VLI
Mar 22 18:19:07 cors118 kernel: EFLAGS: 00010046 (2.6.9-42.ELsmp)
Mar 22 18:19:07 cors118 kernel: EIP is at _spin_lock_irqsave+0x7/0x45
Mar 22 18:19:07 cors118 kernel: eax: 00000050 ebx: 00000246 ecx: e99fd200
edx: 00000000
Mar 22 18:19:07 cors118 kernel: esi: 00000050 edi: 00000050 ebp: 00000000
esp: f368ce18
Mar 22 18:19:07 cors118 kernel: ds: 007b es: 007b ss: 0068
Mar 22 18:19:07 cors118 kernel: Process bash (pid: 4452, threadinfo=f368c000
task=f10f9830)
Mar 22 18:19:07 cors118 kernel: Stack: 00000000 e99fd240 f88edb33 e99fd000
e99fd240 e99fd240 00008020 f89dfefe
Mar 22 18:19:07 cors118 kernel: e99fd000 e99fd240 e99fd240 f89dc176
00000001 e99fd000 f736aaf0 ea1d5000
Mar 22 18:19:07 cors118 kernel: c016691a 00000001 00000001 f89e8338
f89e8338 00000058 f4be0000 f368ce98
Mar 22 18:19:07 cors118 kernel: Call Trace:
Mar 22 18:19:07 cors118 kernel: [<f88edb33>] cm_destroy_id+0x12/0x1a5 [ib_cm]
Mar 22 18:19:07 cors118 kernel: [<f89dfefe>] ipoib_cm_dev_stop+0x23/0xae
[ib_ipoib]
Mar 22 18:19:07 cors118 kernel: [<f89dc176>] ipoib_ib_dev_stop+0x28/0x33c
[ib_ipoib]
Mar 22 18:19:07 cors118 kernel: [<c016691a>] __link_path_walk+0x133/0xbb5
Mar 22 18:19:07 cors118 kernel: [<c0130e1b>] flush_cpu_workqueue+0x14b/0x153
Mar 22 18:19:07 cors118 kernel: [<c01204f5>] autoremove_wake_function+0x0/0x2d
Mar 22 18:19:07 cors118 kernel: [<c01204f5>] autoremove_wake_function+0x0/0x2d
Mar 22 18:19:07 cors118 kernel: [<c01204f5>] autoremove_wake_function+0x0/0x2d
Mar 22 18:19:07 cors118 kernel: [<f89d9637>] ipoib_flush_paths+0x11a/0x122
[ib_ipoib]
Mar 22 18:19:07 cors118 kernel: [<f89d916d>] ipoib_stop+0x58/0xf8 [ib_ipoib]
Mar 22 18:19:07 cors118 kernel: [<c0280043>] dev_close+0x57/0x77
Mar 22 18:19:07 cors118 kernel: [<c02823fc>] unregister_netdevice+0x94/0x1fa
Mar 22 18:19:07 cors118 kernel: [<c022c7ea>] unregister_netdev+0xf/0x15
Mar 22 18:19:07 cors118 kernel: [<f89debd1>] ipoib_vlan_delete+0x30/0xfa
[ib_ipoib]
Mar 22 18:19:07 cors118 kernel: [<f89daf3c>] delete_child+0x39/0x46 [ib_ipoib]
Mar 22 18:19:07 cors118 kernel: [<f89daf03>] delete_child+0x0/0x46 [ib_ipoib]
Mar 22 18:19:07 cors118 kernel: [<c02202df>] class_device_attr_store+0x19/0x21
Mar 22 18:19:07 cors118 kernel: [<c018e1a7>] flush_write_buffer+0x20/0x25
Mar 22 18:19:07 cors118 kernel: [<c018e203>] sysfs_write_file+0x57/0x7c
Mar 22 18:19:07 cors118 kernel: [<c015b0bc>] vfs_write+0xb6/0xe2
Mar 22 18:19:07 cors118 kernel: [<c015b186>] sys_write+0x3c/0x62
Mar 22 18:19:07 cors118 kernel: [<c02d4703>] syscall_call+0x7/0xb
Mar 22 18:19:07 cors118 kernel: Code: 6c 00 0c 60 2e c0 0f b6 02 84 c0 7e 08 0f
0b 6d 00 0c 60 2e c0 86 0a c3 f0 81 00 00 00 00 01 c3 f0 ff 00 c3 56 89 c6 53
9c 5b fa <81> 78 04 ad 4e ad
de 74 18 ff 74 24 08 68 a1 6f 2e c0 e8 62 f5
SYSTEM INFORMATIONS :
[root at cors118 ~]# uname -a
Linux cors118 2.6.9-42.ELsmp #1 SMP Wed Jul 12 23:27:17 EDT 2006 i686 i686 i386
GNU/Linux
[root at cors118 ~]# cat /etc/redhat-release
Red Hat Enterprise Linux WS release 4 (Nahant Update 4)
[root at cors118 ~]# rpm -qa kernel-ib
kernel-ib-1.2-2.6.9_42.ELsmp
[root at cors118 ~]# lspci | grep Mella
03:01.0 PCI bridge: Mellanox Technologies MT23108 PCI Bridge (rev a1)
04:00.0 InfiniBand: Mellanox Technologies MT23108 InfiniHost (rev a1)
PARTITIONS CONFIGURED BY THE CISCO SUBNET MANAGER :
[root at cors118 ib0]# grep -v 0x0000 /sys/class/infiniband/mthca0/ports/1/pkeys/*
/sys/class/infiniband/mthca0/ports/1/pkeys/0:0xffff
/sys/class/infiniband/mthca0/ports/1/pkeys/1:0x8001
/sys/class/infiniband/mthca0/ports/1/pkeys/2:0x8002
/sys/class/infiniband/mthca0/ports/1/pkeys/3:0x8010
PROCEDURE
[root at cors118 ib0]# dmesg | grep ib0
divert: not allocating divert_blk for non-ethernet device ib0
root at cors118 ib0]# pwd
/sys/class/net/ib0
[root at cors118 ib0]# echo 0x8001 > create_child
[root at cors118 ib0]# ls ..
eth0 eth1 ib0 ib0.8001 ib1 lo sit0
[root at cors118 ib0]# dmesg | grep ib0
divert: not allocating divert_blk for non-ethernet device ib0
divert: not allocating divert_blk for non-ethernet device ib0.8001
[root at cors118 ib0]# echo 0x8001 > delete_child
[root at cors118 ib0]# dmesg | grep ib0
divert: not allocating divert_blk for non-ethernet device ib0
divert: not allocating divert_blk for non-ethernet device ib0.8001
divert: no divert_blk to free, ib0.8001 not ethernet
[root at cors118 ib0]# echo 0x8020 > create_child
[root at cors118 ib0]# dmesg | grep ib0
divert: not allocating divert_blk for non-ethernet device ib0
divert: not allocating divert_blk for non-ethernet device ib0.8001
divert: no divert_blk to free, ib0.8001 not ethernet
divert: not allocating divert_blk for non-ethernet device ib0.8020
[root at cors118 ib0]# ls ..
eth0 eth1 ib0 ib0.8020 ib1 lo sit0
[root at cors118 ib0]# echo 0x8020 > delete_child
.... you got it !
--
Configure bugmail: https://bugs.openfabrics.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
More information about the general
mailing list