[openib-general] page allocation failure with kdapl/mthca
Hal Rosenstock
halr at voltaire.com
Tue Jun 21 08:13:07 PDT 2005
Hi Roland,
With the latest mthca changes (this test worked prior to these changes),
I get the following running kdapl in "loopback":
Jun 21 08:56:09 localhost kernel: DT_Mdep_Thread_: page allocation failure. order:0, mode:0x20
Jun 21 08:56:09 localhost kernel: [<c01470f2>] __alloc_pages+0x2b2/0x440
Jun 21 08:56:09 localhost kernel: [<c014d77e>] cache_alloc_debugcheck_after+0x6e/0x1a0
Jun 21 08:56:09 localhost kernel: [<c014729f>] __get_free_pages+0x1f/0x40
Jun 21 08:56:09 localhost kernel: [<c010a8ae>] dma_alloc_coherent+0xce/0x100
Jun 21 08:56:09 localhost kernel: [<c014ddfd>] __kmalloc+0x9d/0xe0
Jun 21 08:56:10 localhost kernel: [<d08dce3e>] mthca_alloc_wqe_buf+0x44e/0x490 [ib_mthca]
Jun 21 08:56:10 localhost kernel: [<c0146bb2>] buffered_rmqueue+0xd2/0x2b0
Jun 21 08:56:10 localhost kernel: [<d08dd291>] mthca_alloc_qp_common+0x71/0x1b0 [ib_mthca]
Jun 21 08:56:10 localhost kernel: [<d08dd634>] mthca_alloc_qp+0xb4/0x270 [ib_mthca]
Jun 21 08:56:10 localhost kernel: [<d08e2c16>] mthca_create_qp+0x186/0x2f0 [ib_mthca]
Jun 21 08:56:10 localhost kernel: [<d08bebdc>] ib_create_qp+0x2c/0x90 [ib_core]
Jun 21 08:56:10 localhost kernel: [<d09c2156>] dapl_ib_qp_alloc+0x156/0x377 [ib_dat_provider]
Jun 21 08:56:10 localhost kernel: [<c014ddfd>] __kmalloc+0x9d/0xe0
Jun 21 08:56:10 localhost kernel: [<d09cf531>] dapl_evd_qp_async_error_callback+0x0/0xf5 [ib_dat_provider]
Jun 21 08:56:10 localhost kernel: [<d09ca22e>] dapl_ep_create+0x37b/0x42a [ib_dat_provider]
Jun 21 08:56:10 localhost kernel: [<d0a25e7c>] DT_Transaction_Main+0x90c/0x21a0 [kdapltest]
Jun 21 08:56:10 localhost kernel: [<c011180d>] __change_page_attr+0x2d/0x170
Jun 21 08:56:10 localhost kernel: [<c0111ce8>] kernel_map_pages+0x28/0x70
Jun 21 08:56:10 localhost kernel: [<c014d196>] cache_free_debugcheck+0x196/0x2d0
Jun 21 08:56:10 localhost kernel: [<d0a3395f>] DT_Mdep_Thread_Start_Routine+0x1f/0x30 [kdapltest]
Jun 21 08:56:10 localhost kernel: [<d0a33940>] DT_Mdep_Thread_Start_Routine+0x0/0x30 [kdapltest]
Jun 21 08:56:10 localhost kernel: [<c0100875>] kernel_thread_helper+0x5/0x10
Jun 21 08:56:10 localhost kernel: ------------[ cut here ]------------
Jun 21 08:56:10 localhost kernel: kernel BUG at mm/page_alloc.c:915!
Jun 21 08:56:10 localhost kernel: invalid operand: 0000 [#1]
Jun 21 08:56:10 localhost kernel: DEBUG_PAGEALLOC
Jun 21 08:56:10 localhost kernel: Modules linked in: kdapltest ib_dat_provider ib_cm ib_at dat ib_ipoib ib_sa ide_cd cdrom lp ipv6 autofs parport_pc parport uhci_hcd ehci_hcd ib_mthca ib_mad ib_core ohci_hcd eepro100 mii evdev usbcore
Jun 21 08:56:10 localhost kernel: CPU: 0
Jun 21 08:56:10 localhost kernel: EIP: 0060:[<c01473c4>] Not tainted VLI
Jun 21 08:56:10 localhost kernel: EFLAGS: 00010206 (2.6.11.6)
Jun 21 08:56:10 localhost kernel: EIP is at free_pages+0x24/0x40
Jun 21 08:56:10 localhost kernel: eax: 0009a5a5 ebx: 00000000 ecx: c0f4adf8 edx: 00000000
Jun 21 08:56:10 localhost kernel: esi: cdb1f7f8 edi: 00000000 ebp: 0000001d esp: c6f23c68
Jun 21 08:56:10 localhost kernel: ds: 007b es: 007b ss: 0068
Jun 21 08:56:10 localhost kernel: Process DT_Mdep_Thread_ (pid: 7983, threadinfo=c6f22000 task=c607da90)
Jun 21 08:56:10 localhost kernel: Stack: d08dcd3d cf907c3c 00001000 5a5a5a5a 00000000 c0146bb2 00000000 c101e940
Jun 21 08:56:10 localhost kernel: 00000000 00000001 00000282 c101e940 c04bddf8 fffffff4 c04bddf8 0000000c
Jun 21 08:56:10 localhost kernel: 00000040 00b01000 c6f23dd4 c0f4adf8 cdb1f7f8 cdca37f0 d08dd291 cdb1f7f8
Jun 21 08:56:10 localhost kernel: Call Trace:
Jun 21 08:56:10 localhost kernel: [<d08dcd3d>] mthca_alloc_wqe_buf+0x34d/0x490 [ib_mthca]
Jun 21 08:56:10 localhost kernel: [<c0146bb2>] buffered_rmqueue+0xd2/0x2b0
Jun 21 08:56:10 localhost kernel: [<d08dd291>] mthca_alloc_qp_common+0x71/0x1b0 [ib_mthca]
Jun 21 08:56:10 localhost kernel: [<d08dd634>] mthca_alloc_qp+0xb4/0x270 [ib_mthca]
Jun 21 08:56:10 localhost kernel: [<d08e2c16>] mthca_create_qp+0x186/0x2f0 [ib_mthca]
Jun 21 08:56:10 localhost kernel: [<d08bebdc>] ib_create_qp+0x2c/0x90 [ib_core]
Jun 21 08:56:10 localhost kernel: [<d09c2156>] dapl_ib_qp_alloc+0x156/0x377 [ib_dat_provider]
Jun 21 08:56:10 localhost kernel: [<c014ddfd>] __kmalloc+0x9d/0xe0
Jun 21 08:56:10 localhost kernel: [<d09cf531>] dapl_evd_qp_async_error_callback+0x0/0xf5 [ib_dat_provider]
Jun 21 08:56:10 localhost kernel: [<d09ca22e>] dapl_ep_create+0x37b/0x42a [ib_dat_provider]
Jun 21 08:56:10 localhost kernel: [<d0a25e7c>] DT_Transaction_Main+0x90c/0x21a0 [kdapltest]
Jun 21 08:56:10 localhost kernel: [<c011180d>] __change_page_attr+0x2d/0x170
Jun 21 08:56:10 localhost kernel: [<c0111ce8>] kernel_map_pages+0x28/0x70
Jun 21 08:56:10 localhost kernel: [<c014d196>] cache_free_debugcheck+0x196/0x2d0
Jun 21 08:56:10 localhost kernel: [<d0a3395f>] DT_Mdep_Thread_Start_Routine+0x1f/0x30 [kdapltest]
Jun 21 08:56:10 localhost kernel: [<d0a33940>] DT_Mdep_Thread_Start_Routine+0x0/0x30 [kdapltest]
Jun 21 08:56:10 localhost kernel: [<c0100875>] kernel_thread_helper+0x5/0x10
Jun 21 08:56:10 localhost kernel: Code: 8d b4 26 00 00 00 00 85 c0 74 2c 05 00 00 00 40 c1 e8 0c 3b 05 40 aa 3f c0 73 10 8b 0d 50 aa 3f c0 c1 e0 05 01 c8 e9 7c ff ff ff <0f> 0b 93 03 67 6b 31 c0 eb e6 89 f6 c3 eb 0d 90 90 90 90 90 90
Is this related to the recent changes in mthca ? Does something in kdapl
need to change ?
Thanks.
-- Hal
More information about the general
mailing list