]> granicus.if.org Git - zfs/commit
Fix dnode allocation race
authorBrian Behlendorf <behlendorf1@llnl.gov>
Tue, 8 Aug 2017 15:38:53 +0000 (08:38 -0700)
committerGitHub <noreply@github.com>
Tue, 8 Aug 2017 15:38:53 +0000 (08:38 -0700)
commit9631681b75336ec6265d8fa5cecb353687c1f373
treebbfcadbf6ca2a583d27629cf3428401e9f6a85c6
parentd19a6d5c80fb24451a7d76716eaf38d3a3f933c7
Fix dnode allocation race

When performing concurrent object allocations using the new
multi-threaded allocator and large dnodes it's possible to
allocate overlapping large dnodes.

This case should have been handled by detecting an error
returned by dnode_hold_impl().  But that logic only checked
the returned dnp was not-NULL, and the dnp variable was not
reset to NULL when retrying.  Resolve this issue by properly
checking the return value of dnode_hold_impl().

Additionally, it was possible that dnode_hold_impl() would
misreport a dnode as free when it was in fact in use.  This
could occurs for two reasons:

* The per-slot zrl_lock must be held over the entire critical
  section which includes the alloc/free until the new dnode
  is assigned to children_dnodes.  Additionally, all of the
  zrl_lock's in the range must be held to protect moving
  dnodes.

* The dn->dn_ot_type cannot be solely relied upon to check
  the type.  When allocating a new dnode its type will be
  DMU_OT_NONE after dnode_create().  Only latter when
  dnode_allocate() is called will it transition to the new
  type.  This means there's a window when allocating where
  it can mistaken for a free dnode.

Reviewed-by: Giuseppe Di Natale <dinatale2@llnl.gov>
Reviewed-by: Ned Bass <bass6@llnl.gov>
Reviewed-by: Tony Hutter <hutter2@llnl.gov>
Reviewed-by: Olaf Faaland <faaland1@llnl.gov>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #6414
Closes #6439
module/zfs/dmu_object.c
module/zfs/dmu_tx.c
module/zfs/dnode.c
module/zfs/zap_micro.c
module/zfs/zfs_znode.c
tests/runfiles/linux.run
tests/zfs-tests/tests/functional/features/large_dnode/Makefile.am
tests/zfs-tests/tests/functional/features/large_dnode/large_dnode_008_pos.ksh [new file with mode: 0755]