[ofa-general] [Bug 485] New: creating & deleting a subinterface with a bad pkey crashs the kernel: NULL pointer reference

bugzilla-daemon at lists.openfabrics.org bugzilla-daemon at lists.openfabrics.org
Thu Mar 22 09:49:18 PDT 2007


https://bugs.openfabrics.org/show_bug.cgi?id=485

           Summary: creating & deleting a subinterface with a bad pkey
                    crashs the kernel: NULL pointer reference
           Product: OpenFabrics Linux
           Version: 1.2beta1
          Platform: X86
        OS/Version: RHEL 4
            Status: NEW
          Severity: normal
          Priority: P1
         Component: IB Core
        AssignedTo: bugzilla at openib.org
        ReportedBy: Philippe.Gregoire at cea.fr


Creating and deleting a subinterface with a pkey partition which
do not include the node yields into a panic :
Mar 22 18:19:07 cors118 kernel: Unable to handle kernel NULL pointer
dereference at virtual address 00000054
Mar 22 18:19:07 cors118 kernel:  printing eip:
Mar 22 18:19:07 cors118 kernel: c02d3325
Mar 22 18:19:07 cors118 kernel: *pde = 2cbd1001
Mar 22 18:19:07 cors118 kernel: Oops: 0000 [#1]
Mar 22 18:19:07 cors118 kernel: SMP
Mar 22 18:19:07 cors118 kernel: Modules linked in: parport_pc lp parport
autofs4 i2c_dev i2c_core nfs lockd nfs_acl sunrpc rdma_ucm(U) ib_sdp(U)
rdma_cm(U) iw_cm(U) ib_addr(U) ib_local_s
a(U) ib_ipoib(U) md5 ipv6 ide_dump cciss_dump scsi_dump diskdump zlib_deflate
dm_mirror dm_mod button battery ac ohci_hcd ib_mthca(U) ib_umad(U) ib_ucm(U)
ib_uverbs(U) ib_cm(U) ib_sa(U)
ib_mad(U) ib_core(U) tg3 floppy ext3 jbd lpfc scsi_transport_fc cciss sd_mod
scsi_mod
Mar 22 18:19:07 cors118 kernel: CPU:    3
Mar 22 18:19:07 cors118 kernel: EIP:    0060:[<c02d3325>]    Not tainted VLI
Mar 22 18:19:07 cors118 kernel: EFLAGS: 00010046   (2.6.9-42.ELsmp)
Mar 22 18:19:07 cors118 kernel: EIP is at _spin_lock_irqsave+0x7/0x45
Mar 22 18:19:07 cors118 kernel: eax: 00000050   ebx: 00000246   ecx: e99fd200  
edx: 00000000
Mar 22 18:19:07 cors118 kernel: esi: 00000050   edi: 00000050   ebp: 00000000  
esp: f368ce18
Mar 22 18:19:07 cors118 kernel: ds: 007b   es: 007b   ss: 0068
Mar 22 18:19:07 cors118 kernel: Process bash (pid: 4452, threadinfo=f368c000
task=f10f9830)
Mar 22 18:19:07 cors118 kernel: Stack: 00000000 e99fd240 f88edb33 e99fd000
e99fd240 e99fd240 00008020 f89dfefe
Mar 22 18:19:07 cors118 kernel:        e99fd000 e99fd240 e99fd240 f89dc176
00000001 e99fd000 f736aaf0 ea1d5000
Mar 22 18:19:07 cors118 kernel:        c016691a 00000001 00000001 f89e8338
f89e8338 00000058 f4be0000 f368ce98
Mar 22 18:19:07 cors118 kernel: Call Trace:
Mar 22 18:19:07 cors118 kernel:  [<f88edb33>] cm_destroy_id+0x12/0x1a5 [ib_cm]
Mar 22 18:19:07 cors118 kernel:  [<f89dfefe>] ipoib_cm_dev_stop+0x23/0xae
[ib_ipoib]
Mar 22 18:19:07 cors118 kernel:  [<f89dc176>] ipoib_ib_dev_stop+0x28/0x33c
[ib_ipoib]
Mar 22 18:19:07 cors118 kernel:  [<c016691a>] __link_path_walk+0x133/0xbb5
Mar 22 18:19:07 cors118 kernel:  [<c0130e1b>] flush_cpu_workqueue+0x14b/0x153
Mar 22 18:19:07 cors118 kernel:  [<c01204f5>] autoremove_wake_function+0x0/0x2d
Mar 22 18:19:07 cors118 kernel:  [<c01204f5>] autoremove_wake_function+0x0/0x2d
Mar 22 18:19:07 cors118 kernel:  [<c01204f5>] autoremove_wake_function+0x0/0x2d
Mar 22 18:19:07 cors118 kernel:  [<f89d9637>] ipoib_flush_paths+0x11a/0x122
[ib_ipoib]
Mar 22 18:19:07 cors118 kernel:  [<f89d916d>] ipoib_stop+0x58/0xf8 [ib_ipoib]
Mar 22 18:19:07 cors118 kernel:  [<c0280043>] dev_close+0x57/0x77
Mar 22 18:19:07 cors118 kernel:  [<c02823fc>] unregister_netdevice+0x94/0x1fa
Mar 22 18:19:07 cors118 kernel:  [<c022c7ea>] unregister_netdev+0xf/0x15
Mar 22 18:19:07 cors118 kernel:  [<f89debd1>] ipoib_vlan_delete+0x30/0xfa
[ib_ipoib]
Mar 22 18:19:07 cors118 kernel:  [<f89daf3c>] delete_child+0x39/0x46 [ib_ipoib]
Mar 22 18:19:07 cors118 kernel:  [<f89daf03>] delete_child+0x0/0x46 [ib_ipoib]
Mar 22 18:19:07 cors118 kernel:  [<c02202df>] class_device_attr_store+0x19/0x21
Mar 22 18:19:07 cors118 kernel:  [<c018e1a7>] flush_write_buffer+0x20/0x25
Mar 22 18:19:07 cors118 kernel:  [<c018e203>] sysfs_write_file+0x57/0x7c
Mar 22 18:19:07 cors118 kernel:  [<c015b0bc>] vfs_write+0xb6/0xe2
Mar 22 18:19:07 cors118 kernel:  [<c015b186>] sys_write+0x3c/0x62
Mar 22 18:19:07 cors118 kernel:  [<c02d4703>] syscall_call+0x7/0xb
Mar 22 18:19:07 cors118 kernel: Code: 6c 00 0c 60 2e c0 0f b6 02 84 c0 7e 08 0f
0b 6d 00 0c 60 2e c0 86 0a c3 f0 81 00 00 00 00 01 c3 f0 ff 00 c3 56 89 c6 53
9c 5b fa <81> 78 04 ad 4e ad
 de 74 18 ff 74 24 08 68 a1 6f 2e c0 e8 62 f5

SYSTEM INFORMATIONS :
[root at cors118 ~]# uname -a
Linux cors118 2.6.9-42.ELsmp #1 SMP Wed Jul 12 23:27:17 EDT 2006 i686 i686 i386
GNU/Linux
[root at cors118 ~]# cat /etc/redhat-release
Red Hat Enterprise Linux WS release 4 (Nahant Update 4)
[root at cors118 ~]# rpm -qa kernel-ib
kernel-ib-1.2-2.6.9_42.ELsmp
[root at cors118 ~]# lspci | grep Mella
03:01.0 PCI bridge: Mellanox Technologies MT23108 PCI Bridge (rev a1)
04:00.0 InfiniBand: Mellanox Technologies MT23108 InfiniHost (rev a1)

PARTITIONS CONFIGURED BY THE CISCO SUBNET MANAGER :
[root at cors118 ib0]# grep -v 0x0000 /sys/class/infiniband/mthca0/ports/1/pkeys/*
/sys/class/infiniband/mthca0/ports/1/pkeys/0:0xffff
/sys/class/infiniband/mthca0/ports/1/pkeys/1:0x8001
/sys/class/infiniband/mthca0/ports/1/pkeys/2:0x8002
/sys/class/infiniband/mthca0/ports/1/pkeys/3:0x8010

PROCEDURE
[root at cors118 ib0]# dmesg | grep ib0
divert: not allocating divert_blk for non-ethernet device ib0
root at cors118 ib0]# pwd
/sys/class/net/ib0
[root at cors118 ib0]# echo 0x8001 > create_child
[root at cors118 ib0]# ls ..
eth0  eth1  ib0  ib0.8001  ib1  lo  sit0
[root at cors118 ib0]# dmesg | grep ib0
divert: not allocating divert_blk for non-ethernet device ib0
divert: not allocating divert_blk for non-ethernet device ib0.8001
[root at cors118 ib0]# echo 0x8001 > delete_child
[root at cors118 ib0]# dmesg | grep ib0
divert: not allocating divert_blk for non-ethernet device ib0
divert: not allocating divert_blk for non-ethernet device ib0.8001
divert: no divert_blk to free, ib0.8001 not ethernet
[root at cors118 ib0]# echo 0x8020 > create_child
[root at cors118 ib0]# dmesg | grep ib0
divert: not allocating divert_blk for non-ethernet device ib0
divert: not allocating divert_blk for non-ethernet device ib0.8001
divert: no divert_blk to free, ib0.8001 not ethernet
divert: not allocating divert_blk for non-ethernet device ib0.8020
[root at cors118 ib0]# ls ..
eth0  eth1  ib0  ib0.8020  ib1  lo  sit0
[root at cors118 ib0]# echo 0x8020 > delete_child                           
.... you got it !


-- 
Configure bugmail: https://bugs.openfabrics.org/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.




More information about the general mailing list