starfive-tech/linux.git - StarFive Tech Linux Kernel for VisionFive (JH7110) boards (mirror)

Age	Commit message (Collapse)	Author	Files	Lines
2008-09-25	Btrfs: Use async helpers to deal with pages that have been improperly dirtied	Chris Mason	1	-0/+4
	Higher layers sometimes call set_page_dirty without asking the filesystem to help. This causes many problems for the data=ordered and cow code. This commit detects pages that haven't been properly setup for IO and kicks off an async helper to deal with them. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: New data=ordered implementation	Chris Mason	1	-1/+12
	The old data=ordered code would force commit to wait until all the data extents from the transaction were fully on disk. This introduced large latencies into the commit and stalled new writers in the transaction for a long time. The new code changes the way data allocations and extents work: * When delayed allocation is filled, data extents are reserved, and the extent bit EXTENT_ORDERED is set on the entire range of the extent. A struct btrfs_ordered_extent is allocated an inserted into a per-inode rbtree to track the pending extents. * As each page is written EXTENT_ORDERED is cleared on the bytes corresponding to that page. * When all of the bytes corresponding to a single struct btrfs_ordered_extent are written, The previously reserved extent is inserted into the FS btree and into the extent allocation trees. The checksums for the file data are also updated. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Drop some verbose printks	Chris Mason	1	-2/+0
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Add locking around volume management (device add/remove/balance)	Chris Mason	1	-0/+1
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Online btree defragmentation fixes	Chris Mason	1	-58/+3
	The btree defragger wasn't making forward progress because the new key wasn't being saved by the btrfs_search_forward function. This also disables the automatic btree defrag, it wasn't scaling well to huge filesystems. The auto-defrag needs to be done differently. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Replace the transaction work queue with kthreads	Chris Mason	1	-9/+107
	This creates one kthread for commits and one kthread for deleting old snapshots. All the work queues are removed. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Add btrfs_end_transaction_throttle to force writers to wait for pending commits	Chris Mason	1	-18/+0
	The existing throttle mechanism was often not sufficient to prevent new writers from coming in and making a given transaction run forever. This adds an explicit wait at the end of most operations so they will allow the current transaction to close. There is no wait inside file_write, inode updates, or cow filling, all which have different deadlock possibilities. This is a temporary measure until better asynchronous commit support is added. This code leads to stalls as it waits for data=ordered writeback, and it really needs to be fixed. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Fix snapshot deletion to release the alloc_mutex much more often.	Chris Mason	1	-0/+2
	This lowers the impact of snapshot deletion on the rest of the FS. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Drop locks in btrfs_search_slot when reading a tree block.	Chris Mason	1	-0/+1
	One lock per btree block can make for significant congestion if everyone has to wait for IO at the high levels of the btree. This drops locks held by a path when doing reads during a tree search. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Replace the big fs_mutex with a collection of other locks	Chris Mason	1	-8/+7
	Extent alloctions are still protected by a large alloc_mutex. Objectid allocations are covered by a objectid mutex Other btree operations are protected by a lock on individual btree nodes Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Start btree concurrency work.	Chris Mason	1	-1/+12
	The allocation trees and the chunk trees are serialized via their own dedicated mutexes. This means allocation location is still not very fine grained. The main FS btree is protected by locks on each block in the btree. Locks are taken top / down, and as processing finishes on a given level of the tree, the lock is released after locking the lower level. The end result of a search is now a path where only the lowest level is locked. Releasing or freeing the path drops any locks held. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Add a thread pool just for submit_bio	Chris Mason	1	-0/+4
	If a bio submission is after a lock holder waiting for the bio on the work queue, it is possible to deadlock. Move the bios into their own pool. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Add a mount option to control worker thread pool size	Chris Mason	1	-15/+15
	mount -o thread_pool_size changes the default, which is min(num_cpus + 2, 8). Larger thread pools would make more sense on very large disk arrays. This mount option controls the max size of each thread pool. There are multiple thread pools, so the total worker count will be larger than the mount option. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Add async worker threads for pre and post IO checksumming	Chris Mason	1	-118/+82
	Btrfs has been using workqueues to spread the checksumming load across other CPUs in the system. But, workqueues only schedule work on the same CPU that queued the work, giving them a limited benefit for systems with higher CPU counts. This code adds a generic facility to schedule work with pools of kthreads, and changes the bio submission code to queue bios up. The queueing is important to make sure large numbers of procs on the system don't turn streaming workloads into random workloads by sending IO down concurrently. The end result of all of this is much higher performance (and CPU usage) when doing checksumming on large machines. Two worker pools are created, one for writes and one for endio processing. The two could deadlock if we tried to service both from a single pool. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	btrfs: sanity mount option parsing and early mount code	Christoph Hellwig	1	-1/+4
	Also adds lots of comments to describe what's going on here. Signed-off-by: Christoph Hellwig <hch@lst.de> Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: bdi_init and bdi_destroy come with 2.6.23	Jan Engelhardt	1	-3/+3
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Always use the async submission queue for checksummed writes	Chris Mason	1	-7/+0
	This avoids IO stalls and poorly ordered IO from inline writers mixing in with the async submission queue Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Enable btree balancing on old kernels again	Chris Mason	1	-3/+0
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Change the congestion functions to meter the number of async submits ↵	Chris Mason	1	-0/+9
	as well The async submit workqueue was absorbing too many requests, leading to long stalls where the async submitters were stalling. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Fix btrfs_open_devices to deal with changes since the scan ioctls	Chris Mason	1	-2/+2
	Devices can change after the scan ioctls are done, and btrfs_open_devices needs to be able to verify them as they are opened and used by the FS. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Add mount -o degraded to allow mounts to continue with missing devices	Chris Mason	1	-20/+29
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Handle write errors on raid1 and raid10	Chris Mason	1	-5/+51
	When duplicate copies exist, writes are allowed to fail to one of those copies. This changeset includes a few changes that allow the FS to continue even when some IOs fail. It also adds verification of the parent generation number for btree blocks. This generation is stored in the pointer to a block, and it ensures that missed writes to are detected. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Pass down the expected generation number when reading tree blocks	Chris Mason	1	-17/+13
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Don't do btree balance_dirty_pages on old kernels, it stalls forever	Chris Mason	1	-0/+8
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Add support for online device removal	Chris Mason	1	-55/+46
	This required a few structural changes to the code that manages bdev pointers: The VFS super block now gets an anon-bdev instead of a pointer to the lowest bdev. This allows us to avoid swapping the super block bdev pointer around at run time. The code to read in the super block no longer goes through the extent buffer interface. Things got ugly keeping the mapping constant. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Fixes for 2.6.18 enterprise kernels	Chris Mason	1	-5/+19
	2.6.18 seems to get caught in an infinite loop when cancel_rearming_delayed_workqueue is called more than once, so this switches to cancel_delayed_work, which is arguably more correct. Also, balance_dirty_pages can run into problems with 2.6.18 based kernels because it doesn't have the per-bdi dirty limits. This avoids calling balance_dirty_pages on the btree inode unless there is actually something to balance, which is a good optimization in general. Finally there's a compile fix for ordered-data.h Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Deal with failed writes in mirrored configurations	Chris Mason	1	-2/+15
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Drop some verbose printks	Chris Mason	1	-13/+5
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Make the resizer work based on shrinking and growing devices	Chris Mason	1	-0/+4
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Add failure handling for read_sys_array	Chris Mason	1	-2/+9
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Fix the unplug_io_fn to grab a consistent copy of page->mapping	Chris Mason	1	-1/+12
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Deal with page == NULL in the btrfs_unplug_io_fn	Chris Mason	1	-2/+30
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Make an unplug function that doesn't unplug every spindle	Chris Mason	1	-11/+15
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Remove debugging statements from the invalidatepage calls	Chris Mason	1	-1/+1
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Scale the bdi ra_pages by the number of devices in the FS	Chris Mason	1	-1/+3
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Force page->private removal in btrfs_invalidatepage	Chris Mason	1	-0/+12
	btrfs_invalidatepage is not allowed to leave pages around on the lru. Any such pages will trigger an oops later on because the VM will see page->private and assume it is a buffer head. This also forces extra flushes of the async work queues before dropping all the pages on the btree inode during unmount. Left over items on the work queues are one possible cause of busy state ranges during truncate_inode_pages. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Set the btree inode i_size to OFFSET_MAX	Chris Mason	1	-7/+26
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Don't drop extent_map cache during releasepage on the btree inode	Chris Mason	1	-9/+14
	The btree inode should only have a single extent_map in the cache, it doesn't make sense to ever drop it. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Only do async bio submission for pdflush	Chris Mason	1	-0/+7
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Create a work queue for bio writes	Chris Mason	1	-3/+90
	This allows checksumming to happen in parallel among many cpus, and keeps us from bogging down pdflush with the checksumming code. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Add chunk uuids and update multi-device back references	Chris Mason	1	-1/+5
	Block headers now store the chunk tree uuid Chunk items records the device uuid for each stripes Device extent items record better back refs to the chunk tree Block groups record better back refs to the chunk tree The chunk tree format has also changed. The objectid of BTRFS_CHUNK_ITEM_KEY used to be the logical offset of the chunk. Now it is a chunk tree id, with the logical offset being stored in the offset field of the key. This allows a single chunk tree to record multiple logical address spaces, upping the number of bytes indexed by a chunk tree from 2^64 to 2^128. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: A few updates for 2.6.18 and versions older than 2.6.25	Chris Mason	1	-2/+10
	This includes fixing a missing spinlock init call that caused oops on mount for most kernels other than 2.6.25. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: bio_endio support for linux 2.6.23 and older.	Miguel	1	-1/+4
	bio_endio() changed prototype on linux 2.6.24, support older kernels using the older prototype. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Endianess bug fix for v0.13 with kernels	Miguel	1	-2/+2
	Fix for a endianess BUG when using btrfs v0.13 with kernels older than 2.6.23 Problem: Has of v0.13, btrfs-progs is using crc32c.c equivalent to the one found on linux-2.6.23/lib/libcrc32c.c Since crc32c_le() changed in linux-2.6.23, when running btrfs v0.13 with older kernels we have a missmatch between the versions of crc32c_le() from btrfs-progs and libcrc32c in the kernel. This missmatch causes a bug when using btrfs on big endian machines. Solution: btrfs_crc32c() macro that when compiling for kernels older than 2.6.23, does endianess conversion to parameters and return value of crc32c(). This endianess conversion nullifies the differences in implementation of crc32c_le(). If kernel 2.6.23 or better, it calls crc32c(). Signed-off-by: Miguel Sousa Filipe <miguel.filipe@gmail.com> --- Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Add extra checks to avoid removing extent_state from pages we can't free	Chris Mason	1	-0/+6
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Write out all super blocks on commit, and bring back proper barrier ↵	Chris Mason	1	-5/+113
	support Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Retry metadata reads in the face of checksum failures	Chris Mason	1	-19/+53
	Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Handle data block end_io through the async work queue	Chris Mason	1	-11/+23
	Before it was done by the bio end_io routine, the work queue code is able to scale much better with faster IO subsystems. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Do metadata checksums for reads via a workqueue	Chris Mason	1	-34/+224
	Before, metadata checksumming was done by the callers of read_tree_block, which would set EXTENT_CSUM bits in the extent tree to show that a given range of pages was already checksummed and didn't need to be verified again. But, those bits could go away via try_to_releasepage, and the end result was bogus checksum failures on pages that never left the cache. The new code validates checksums when the page is read. It is a little tricky because metadata blocks can span pages and a single read may end up going via multiple bios. Signed-off-by: Chris Mason <chris.mason@oracle.com>
2008-09-25	Btrfs: Add additional debugging for metadata checksum failures	Chris Mason	1	-1/+2
	Signed-off-by: Chris Mason <chris.mason@oracle.com>