Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

PANIC: dbuf_dirty at dbuf.c:2206: when creating files #16652

Open
Finix1979 opened this issue Oct 15, 2024 · 0 comments
Open

PANIC: dbuf_dirty at dbuf.c:2206: when creating files #16652

Finix1979 opened this issue Oct 15, 2024 · 0 comments
Labels
Type: Defect Incorrect behavior (e.g. crash, hang)

Comments

@Finix1979
Copy link
Contributor

System information

Type Version/Name
Distribution Name kylin10
Distribution Version
Kernel Version 4.19.90-24.4.v2101.ky10.x86_64
Architecture x86_64
OpenZFS Version Tag 2.0.7

Describe the problem you're observing

Our clients have encountered kernel crash issues twice in the past three months. According to the dmesg information, the problem occurred during the process of creating files with ZFS. We are using the ZFS code version 2.0.7.

The code for version 2.0.7 has been around for many years, and I'm not sure if the community has already addressed the issue. If it has been resolved, please let me know the commit ID, and I will attempt to backport it for the client.

Reading symbols from /usr/lib/debug//usr/lib/modules/4.19.90-24.4.v2101.ky10.x86_64/extra/zfs/zfs/zfs.ko-2.0.21-1.ky10.x86_64.debug...
(gdb) list *(dbuf_dirty + 0x517)
0x1e9a7 is in dbuf_dirty (/usr/src/debug/zfs-kmod-2.0.21-1.ky10.x86_64/_kmod_build_4.19.90-24.4.v2101.ky10.x86_64/../zfs-2.0.7/module/zfs/dbuf.c:2206).
2201 parent_held = TRUE;
2202 }
2203 if (drop_struct_rwlock)
2204 rw_exit(&dn->dn_struct_rwlock);
2205 ASSERT3U(db->db_level + 1, ==, parent->db_level);
2206 di = dbuf_dirty(parent, tx);
2207 if (parent_held)
2208 dbuf_rele(parent, FTAG);
2209
2210 mutex_enter(&db->db_mtx);

Describe how to reproduce the problem

Currently, the information I have obtained from the client is limited to this. If I receive more information later, I will provide feedback to the community.

Include any warning/errors/backtraces from the system logs

[7257050.685254] BUG: unable to handle kernel paging request at ffffffffffffffe8
[7257050.685261] PGD 2b38a0d067 P4D 2b38a0d067 PUD 2b38a0f067 PMD 0
[7257050.685268] Oops: 0000 [#1] SMP NOPTI
[7257050.685273] CPU: 105 PID: 1482992 Comm: srepd Kdump: loaded Tainted: P OE 4.19.90-24.4.v2101.ky10.x86_64 #1
[7257050.685275] Hardware name: Suma R6440H0/62DB32, BIOS CXYH051021 11/21/2022
[7257050.685340] RIP: 0010:dbuf_dirty+0x657/0x950 [zfs]
[7257050.685344] Code: e8 4e 1b 8c f2 4c 89 ff c6 07 00 0f 1f 40 00 e9 b8 fd ff ff 48 8b 09 48 39 4c 24 18 74 23 49 89 cd 49 01 c5 0f 84 fe fa ff ff <49> 8b 55 10 48 39 d5 0f 83 e8 fa ff ff 48 8b 09 48 3b 4c 24 18 75
[7257050.685346] RSP: 0018:ffff98fa90be3838 EFLAGS: 00010286
[7257050.685348] RAX: ffffffffffffffd8 RBX: ffff8938b98a7350 RCX: 0000000000000000
[7257050.685350] RDX: ffff894f596ff800 RSI: ffff89697ec56000 RDI: ffff8938b98a7498
[7257050.685352] RBP: 0000000000515ccb R08: 0000000000000000 R09: ffff893187c0ec00
[7257050.685354] R10: ffff98fa90be3890 R11: ffffd4e935aaae08 R12: ffff89697ec56000
[7257050.685355] R13: ffffffffffffffd8 R14: ffff895e31613000 R15: ffff8938b98a7498
[7257050.685358] FS: 00007f8dd67fc700(0000) GS:ffff8960dfc40000(0000) knlGS:0000000000000000
[7257050.685359] CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[7257050.685361] CR2: ffffffffffffffe8 CR3: 00000032ae5a0000 CR4: 00000000003406e0
[7257050.685362] Call Trace:
[7257050.685403] dbuf_dirty+0x517/0x950 [zfs]
[7257050.685442] dnode_setdirty+0x8a/0xe0 [zfs]
[7257050.685484] dnode_allocate+0x108/0x170 [zfs]
[7257050.685516] dmu_object_alloc_impl+0x334/0x3d0 [zfs]
[7257050.685548] ? dbuf_rele_and_unlock+0x1b9/0x620 [zfs]
[7257050.685582] dmu_object_alloc_dnsize+0x1c/0x30 [zfs]
[7257050.685635] zfs_mknode+0x130/0xf40 [zfs]
[7257050.685680] ? dsl_dir_tempreserve_space+0x242/0x4e0 [zfs]
[7257050.685728] ? txg_rele_to_quiesce+0x26/0x40 [zfs]
[7257050.685762] ? dmu_tx_assign+0x1e3/0x450 [zfs]
[7257050.685812] zfs_create+0x620/0x930 [zfs]
[7257050.685862] zpl_create+0xae/0x180 [zfs]
[7257050.685871] path_openat+0x1233/0x15c0
[7257050.685876] do_filp_open+0x9b/0x110
[7257050.685881] ? __check_object_size+0xd4/0x1a0
[7257050.685885] ? strncpy_from_user+0x47/0x180
[7257050.685888] ? __check_object_size+0xd4/0x1a0
[7257050.685891] ? do_sys_open+0x1bd/0x250
[7257050.685893] do_sys_open+0x1bd/0x250
[7257050.685898] do_syscall_64+0x5b/0x1d0
[7257050.685903] entry_SYSCALL_64_after_hwframe+0x44/0xa9
[7257050.685907] RIP: 0033:0x7f8f0449b81e
[7257050.685909] Code: 89 54 24 08 e8 a3 f4 ff ff 8b 74 24 0c 48 8b 3c 24 41 89 c0 44 8b 54 24 08 b8 01 01 00 00 89 f2 48 89 fe bf 9c ff ff ff 0f 05 <48> 3d 00 f0 ff ff 77 30 44 89 c7 89 44 24 08 e8 ce f4 ff ff 8b 44
[7257050.685910] RSP: 002b:00007f8dd67fb560 EFLAGS: 00000293 ORIG_RAX: 0000000000000101
[7257050.685913] RAX: ffffffffffffffda RBX: 00007f8dd67fb870 RCX: 00007f8f0449b81e
[7257050.685914] RDX: 0000000000000042 RSI: 00007f8dc400eea0 RDI: 00000000ffffff9c
[7257050.685916] RBP: 00007f8dd67fb850 R08: 0000000000000000 R09: 00007f8dc400eeb0
[7257050.685917] R10: 0000000000000180 R11: 0000000000000293 R12: 0000000000000042
[7257050.685918] R13: 0000000000000180 R14: 0000000000000180 R15: 000000000000015c
[7257050.685921] Modules linked in: cdc_ether usbnet mii binfmt_misc tcp_diag udp_diag inet_diag unix_diag xfs libcrc32c loop i2dtrack(OE) target_core_mod sfs(OE) dattobd(OE) ip6table_filter ip6_tables iptable_filter bonding sunrpc vfat fat ipmi_ssif amd64_edac_mod edac_mce_amd kvm_amd ccp kvm irqbypass ses crct10dif_pclmul enclosure crc32_pclmul scsi_transport_sas ghash_clmulni_intel joydev pcspkr sg i2c_piix4 k10temp ipmi_si ipmi_devintf ipmi_msghandler zfs(POE) zunicode(POE) zzstd(OE) zlua(OE) zavl(POE) icp(POE) zcommon(POE) znvpair(POE) spl(OE) ip_tables ext4 mbcache jbd2 sd_mod lpfc ast mlx5_core drm_kms_helper syscopyarea sysfillrect nvmet_fc sysimgblt fb_sys_fops nvmet crc32c_intel ttm nvme_fc ahci mlxfw nvme_fabrics devlink igb libahci drm megaraid_sas nvme_core libata scsi_transport_fc dca i2c_algo_bit
[7257050.685976] dm_mirror dm_region_hash dm_log dm_mod
[7257050.685981] CR2: ffffffffffffffe8

@Finix1979 Finix1979 added the Type: Defect Incorrect behavior (e.g. crash, hang) label Oct 15, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Type: Defect Incorrect behavior (e.g. crash, hang)
Projects
None yet
Development

No branches or pull requests

1 participant