Andrew Morton [Fri, 12 Mar 2004 00:25:36 +0000 (16:25 -0800)]
[PATCH] vm: balance inactive zone refill rates
The current refill logic in refill_inactive_zone() takes an arbitrarily large
number of pages and chops it down to SWAP_CLUSTER_MAX*4, regardless of the
size of the zone.
This has the effect of reducing the amount of refilling of large zones
proportionately much more than of small zones.
We made this change in may 2003 and I'm damned if I remember why. let's put
it back so we don't truncate the refill count and see what happens.
Andrew Morton [Fri, 12 Mar 2004 00:25:12 +0000 (16:25 -0800)]
[PATCH] vmscan: batch up inactive list scanning work
From: Nick Piggin <piggin@cyberone.com.au>
Use a "refill_counter" for inactive list scanning, similar to the one used
for active list scanning. This batches up scanning now that we precisely
balance ratios, and don't round up the amount to be done.
No observed benefits, but I imagine it would lower the acquisition
frequency of the lru locks in some cases, and make codepaths more efficient
in general due to cache niceness.
Andrew Morton [Fri, 12 Mar 2004 00:25:03 +0000 (16:25 -0800)]
[PATCH] vmscan: less throttling of page allocators and kswapd
This is just a random unsubstantiated tuning tweak: don't immediately
throttle page allocators and kwapd when the going is getting heavier: scan a
bit more of the LRU before throttling.
Andrew Morton [Fri, 12 Mar 2004 00:24:52 +0000 (16:24 -0800)]
[PATCH] fix the kswapd zone scanning algorithm
This removes a vestige of the old algorithm. We don't want to skip zones if
all_zones_ok is true: we've already precalculated which zones need scanning
and this just stops us from ever performing kswapd reclaim from the DMA zone.
Andrew Morton [Fri, 12 Mar 2004 00:24:40 +0000 (16:24 -0800)]
[PATCH] kswapd: fix lumpy page reclaim
As kswapd is now scanning zones in the highmem->normal->dma direction it can
get into competition with the page allocator: kswapd keep on trying to free
pages from highmem, then kswapd moves onto lowmem. By the time kswapd has
done proportional scanning in lowmem, someone has come in and allocated a few
pages from highmem. So kswapd goes back and frees some highmem, then some
lowmem again. But nobody has allocated any lowmem yet. So we keep on and on
scanning lowmem in response to highmem page allocations.
Andrew Morton [Fri, 12 Mar 2004 00:24:29 +0000 (16:24 -0800)]
[PATCH] kswapd: avoid unnecessary reclaiming from higher zones
Currently kswapd walks across all zones in dma->normal->highmem order,
performing proportional scanning until all zones are OK. This means that
pressure against ZONE_NORMAL causes unnecessary reclaim of ZONE_HIGHMEM.
To fix that up we change kswapd so that it walks the zones in the
high->normal->dma direction, skipping zones which are OK. Once it encounters
a zone which needs some reclaim kswapd will perform proportional scanning
against that zone as well as all the succeeding lower zones.
We scan the lower zones even if they have sufficient free pages. This is
because
a) the lower zone may be above pages_high, but because of the incremental
min, the lower zone may still not be eligible for allocations. That's bad
because cache in that lower zone will then not be scanned at the correct
rate.
b) pages in this lower zone are usable for allocations against the higher
zone. So we do want to san all the relevant zones at an equal rate.
Andrew Morton [Fri, 12 Mar 2004 00:24:10 +0000 (16:24 -0800)]
[PATCH] Balance inter-zone scan rates
When page reclaim is working out how many pages to san in a zone (max-scan)
it presently rounds that number up if it looks too small - for work batching.
Problem is, this can result in excessive scanning against small zones which
have few inactive pages. So remove it.
Not that it is possible for max_scan to be zero. That's OK - it'll become
non-zero as the priority increases.
Andrew Morton [Fri, 12 Mar 2004 00:24:01 +0000 (16:24 -0800)]
[PATCH] vmscan: drive everything via nr_to_scan
Page reclaim is currently a bit schitzo: sometimes we say "go and scan this
many pages and tell me how many pages were freed" and at other times we say
"go and scan this many pages, but stop if you freed this many".
It makes the logic harder to control and to understand. This patch coverts
everything into the "go and scan this many pages and tell me how many pages
were freed" model.
It doesn't seem to affect performance much either way.
Andrew Morton [Fri, 12 Mar 2004 00:23:50 +0000 (16:23 -0800)]
[PATCH] vmscan: zone balancing fix
We currently have a problem with the balancing of reclaim between zones: much
more reclaim happens against highmem than against lowmem.
This patch partially fixes this by changing the direct reclaim path so it
does not bale out of the zone walk after having reclaimed sufficient pages
from highmem: go on to reclaim from lowmem regardless of how many pages we
reclaimed from lowmem.
Andrew Morton [Fri, 12 Mar 2004 00:23:38 +0000 (16:23 -0800)]
[PATCH] vm: scan slab in response to highmem scanning
The patch which went in six months or so back which said "only reclaim slab
if we're scanning lowmem pagecache" was wrong. I must have been asleep at
the time.
We do need to scan slab in response to highmem page reclaim as well. Because
all the math is based around the total amount of memory in the machine, and
we know that if we're performing highmem page reclaim then the lower zones
have no free memory.
Andrew Morton [Fri, 12 Mar 2004 00:22:50 +0000 (16:22 -0800)]
[PATCH] kswapd throttling fixes
The logic in balance_pgdat() is all bollixed up.
- the incoming arg `nr_pages' should be used to determine if we're being
asked to free a specific number of pages, not `to_free'.
- local variable `to_free' is not appropriate for the determination of
whether we failed to bring all zones to appropriate free pages levels.
Fix this by correctly calculating `all_zones_ok' and then use
all_zones_ok to determine whether we need to throttle kswapd.
So the logic now is:
for (increasing priority) {
all_zones_ok = 1;
for (all zones) {
to_reclaim = number of pages to try to reclaim
from this zone;
max_scan = number of pages to scan in this pass
(gets larger as `priority' decreases)
/*
* set `reclaimed' to the number of pages which were
* actually freed up
*/
reclaimed = scan(max_scan pages);
reclaimed += shrink_slab();
to_free -= reclaimed; /* for the `nr_pages>0' case */
/*
* If this scan failed to reclaim `to_reclaim' or more
* pages, we're getting into trouble. Need to scan
* some more, and throttle kswapd. Note that this
* zone may now have sufficient free pages due to
* freeing activity by some other process. That's
* OK - we'll pick that info up on the next pass
* through the loop.
*/
if (reclaimed < to_reclaim)
all_zones_ok = 0;
}
if (to_free > 0)
continue; /* swsusp: need to do more work */
if (all_zones_ok)
break; /* kswapd is done */
/*
* OK, kswapd is getting into trouble. Take a nap, then take
* another pass across the zones.
*/
blk_congestion_wait();
}
Andrew Morton [Fri, 12 Mar 2004 00:22:27 +0000 (16:22 -0800)]
[PATCH] Narrow blk_congestion_wait races
From: Nick Piggin <piggin@cyberone.com.au>
The addition of the smp_mb and the other change is to try to close the
window for races a bit. Obviously they can still happen, it's a racy
interface and it doesn't matter much.
Andrew Morton [Fri, 12 Mar 2004 00:21:24 +0000 (16:21 -0800)]
[PATCH] Add barriers to avoid race in mempool_alloc/free
From: Chris Mason <mason@suse.com>
mempool_alloc() and mempool_free() check pool->curr_nr without any locks
held. This can lead to skipping a wakeup when there are people waiting,
and sleeping when there are free elements in the pool.
I can't trigger this reliably, but sooner or later someone on ppc is
probably going to hit it.
Andrew Morton [Fri, 12 Mar 2004 00:20:22 +0000 (16:20 -0800)]
[PATCH] fbdev: monitor detection fixes
From: James Simmons <jsimmons@infradead.org>,
Kronos <kronos@kronoz.cjb.net>
Various fixes and enhancements to the monitor hardware detection code. The
only driver that uses it is the radeon driver.
Old EDID parsing code was very verbose, half of the patch address this (ie.
print lots of stuff iff DEBUG). The other big change is the FB_MODE_IS_*
stuff: we really need a way to know the origin of a video mode. In this way
we can select video mode that comes from EDID instead of VESA or GTF.
Drivers other than radeonfb won't be affected because they cannot (yet) get
EDID from the monitor and don't use EDID related code.
Andrew Morton [Fri, 12 Mar 2004 00:19:57 +0000 (16:19 -0800)]
[PATCH] fix raid0 readahead size
From: Arjan van de Ven <arjanv@redhat.com>
Readahead of raid0 was suboptimal; it read only 1 stride ahead. The
problem with this is that while it will keep all spindles busy, it will not
actually manage to make larger IO's, eg each disk would just do the chunk
size IO. Doing at least 2 chunks is more than appropriate so that each
spindle will get a chance to merge IO's.
Andrew Morton [Fri, 12 Mar 2004 00:19:45 +0000 (16:19 -0800)]
[PATCH] module.h __attribute_used__ fix
From: Rusty Russell <rusty@rustcorp.com.au>
Someone added __attribute_used__ throughout module.h, but didn't remove the
", unused". Looks like some arch/gcc combos still consider it unused, and
discard the fn.
The attached patch is needed to stop showing us "Macintosh device drivers"
for all architectures via menuconfig || xconfig || gconfig. It's only
necessary for PPC and/or MAC.
Andrew Morton [Fri, 12 Mar 2004 00:19:04 +0000 (16:19 -0800)]
[PATCH] add nowarn to a few pte chain allocators
From: Arjan van de Ven <arjanv@redhat.com>
Several of the pte_chain_alloc() allocators that use GFP_ATOMIC have a
fallback for failure that sleeps; they thus need to not warn on failure..
Seen during a big fork on a busy system.
Andrew Morton [Fri, 12 Mar 2004 00:18:43 +0000 (16:18 -0800)]
[PATCH] EDD: Get Legacy Parameters
From: Matt Domsch <Matt_Domsch@dell.com>
Patch below from Patrick J. LoPresti and myself. Patrick describes:
Why this patch? The problem is that the legacy BIOS interface
(INT13/AH=3D08) for querying the disk geometry returns different values
than the extended INT13 interface which the EDD code currently uses. This
is because the legacy interface only provides a 10-bit cylinder field, so
modern BIOSes "lie" about the head/sector counts in order to make more of
the disk visible within the first 1024 cylinders.
Many non-Linux applications, including the stock Windows boot loader, DOS
fdisk, etc., rely upon the legacy interface and geometry. So it is useful
to be able to obtain the legacy values from a running Linux kernel.
What this patch does is to add new entries under
/sys/firmware/edd/int13_devXX named "legacy_cylinders", "legacy_heads", and
"legacy_sectors". These provide the geometry given by the legacy
INT13/AH=3D08 BIOS interface, just like the current "default_cylinders"
etc. provide the the geometry given by the INT13/AH=3D48 interface.
Without this patch, I cannot use Linux to partition a drive and install
Windows, which happens to be my application.
- Pat
http://unattended.sourceforge.net/
In addition, this adds two buggy BIOS workarounds in the EDD int13
calls as suggested by Ralf Brown's interrupt list.
I'm also interested in moving this code out of arch/i386/kernel/edd.c and
include/asm-i386/edd.h, as I believe it is applicable on x86-64 as well.
However, there's no good place under drivers/ to put edd.c when it's not
tied to a bus, but to several CPU architectures and their firmwares...
Maybe a new directory drivers/firmware?
Andrew Morton [Fri, 12 Mar 2004 00:18:34 +0000 (16:18 -0800)]
[PATCH] wavfront.c needs syscalls.h
sound/oss/wavfront.c: In function `wavefront_download_firmware':
sound/oss/wavfront.c:2524: warning: implicit declaration of function `sys_open'
sound/oss/wavfront.c:2533: warning: implicit declaration of function `sys_read'
sound/oss/wavfront.c:2582: warning: implicit declaration of function `sys_close
Andrew Morton [Fri, 12 Mar 2004 00:18:23 +0000 (16:18 -0800)]
[PATCH] Fix reading the last block on a bdev
From: Chris Mason <mason@suse.com>
This patch fixes a problem we're hitting on ia64 with page sizes > 4k.
When the page size is greater than the block size, and parts of the page
fall past the end of the device, readpage will fail because
blkdev_get_block returns -EIO for blocks past i_size.
The attached patch changes blkdev_get_block to return holes when reading
past the end of the device, which allows us to read that last valid 4k
block and then fill the rest of the page with zeros. Writes will still
fail with -EIO.
Andrew Morton [Fri, 12 Mar 2004 00:17:41 +0000 (16:17 -0800)]
[PATCH] fadvise(POSIX_FADV_DONTNEED) fixups
From: WU Fengguang <wfg@mail.ustc.edu.cn>
- In sys_fadvise64_64(): if the start and/or end offsets do not fall on
page boundaries, preserve the partial pages. The thinking here is that it
is better to preserve needed memory than to not shoot down unneeded memory.
- In invalidate_mapping_pages(): we were invalidating an entire pagevec's
worth of pages each time around, even if that went beyond the part of the
file which the caller asked to be invalidated. Fix that up.
Andrew Morton [Fri, 12 Mar 2004 00:17:10 +0000 (16:17 -0800)]
[PATCH] Remove arbitrary #acl entries limits on ext[23] when reading
From: Andreas Gruenbacher <agruen@suse.de>
Remove the arbitrary limit of 32 ACL entries on ext[23] when reading from
disk. This change is backward compatible; we need to have this change in
to be able to also allow writing big ACLs.
The second patch that removes the ACL entry limit for writes is not
included. I don't want to push that patch now, because large ACLs would
cause 2.4 and current 2.6 kernels to fail. My plan is to remove the second
limit later, in a half-year or year or so.
Andrew Morton [Fri, 12 Mar 2004 00:16:50 +0000 (16:16 -0800)]
[PATCH] /proc data corruption check
From: Arjan van de Ven <arjanv@redhat.com>
If someone removes a /proc directory which still has subdirectories it will
lead to very nasty things (dentries remaining on hash chains etc etc etc).
The BUG_ON in the patch below will catch this nasty situation.
Andrew Morton [Fri, 12 Mar 2004 00:16:39 +0000 (16:16 -0800)]
[PATCH] Remove unneeded unlock in ipc/sem.c
From: Manfred Spraul <manfred@colorfullife.com>
sem_revalidate checks that a semaphore array didn't disappear while the
code was running without the semaphore array spinlock. If the array
disappeared, then it will return without holding a lock. find_undo calls
sem_revalidate and then sem_unlock, even if sem_revalidate failed. The
sem_unlock call must be removed.
Mingming Cao reported a spinlock deadlock with sysv semaphores. A
superflous unlock doesn't explain the deadlock, but it's obviously a bug.
Andrew Morton [Fri, 12 Mar 2004 00:16:17 +0000 (16:16 -0800)]
[PATCH] loop setup race fix
From: Chris Mason <mason@suse.com>
There's a race in loopback setup, it's easiest to trigger with one or more
procs doing loopback mounts at the same time. The problem is that
fs/block_dev.c:do_open() only calls bdev_set_size on the first open.
Picture two procs:
proc1: mount -o loop file1 mnt1
proc2: mount -o loop file2 mnt2
proc1 proc2
open /dev/loop0 # bd_openers now 1
do_open
bd_set_size(bdev, 0) # loop unbound, so bdev size is 0
open /dev/loop0 # bd_openers now 2
loop_set_fd # disk capacity now correct, but
# bdev not updated
mount /dev/loop0 /mnt
do_open
Because bd_openers != 0 for the last do_open, bd_set_size is not called
again and a size of 0 is used. This eventually leads to an oops when the
loop device is unmounted, because fsync_bdev calls block_write_full_page
who decides every page on the block device is outside i_size and unmaps
them.
When ext2 or reiserfs try to sync a metadata buffer, we get an oops on
because the buffers are no longer mapped.
The patch below changes loop_set_fd and loop_clr_fd to also manipulate the
size of the block device, which fixes things for me.
Andrew Morton [Fri, 12 Mar 2004 00:16:06 +0000 (16:16 -0800)]
[PATCH] LOOP_CHANGE_FD ioctl
From: Arjan van de Ven <arjanv@redhat.com>
The patch below (written by Al Viro) solves a nasty chicken-and-egg issue
for operating system installers (well at least anaconda but the problem
domain is not exclusive to that)
The basic problem is this:
- The small first stage installer locates the image file of the second
stage installer (which has X and all the graphical stuff); this image can
be on the same CD, but it can come via NFS, http or ftp or ... as well.
- The first stage installer loop-back mounts this image and gives control
to the second stage installer by calling some binary there.
- The graphical installer then asks the user all those questions and
starts installing packages. Again the packages can come from the CD but
also from NFS or http or ...
Now in case of a CD install, once all requested packages from the first CD
are installed, the installer wants to unmount and eject the CD and prompt
the user to put CD 2 in....... EXCEPT that the unmount can't work since
the installer is actually running from a loopback mount of this cd.
The solution is a "LOOP_CHANGE_FD" ioctl, where basically the installer
copies the image to the harddisk (which can only be done late since only
late the target harddisk is mkfs'd) and then magically switches the backing
store FD from underneath the loop device to the one on the target harddisk
(and thus unbusying the CD mount).
This is obviously only allowed if the size of the new image is identical
and if the loop image is read-only in the first place. It's the
responsibility of root to make sure the contents is the same (but that's of
the give-root-enough-rope kind)
Andrew Morton [Fri, 12 Mar 2004 00:15:54 +0000 (16:15 -0800)]
[PATCH] kbuild: Cause `make clean' to remove more files
From: Sam Ravnborg <sam@ravnborg.org>
Make the difference between 'make clean' and 'make distclean/mrproper' more
explicit.
make clean now removes all generated files except .config* and .version.
The result is much easier to understand now.
make clean deletes all generated files (except .config* and .version).
make mrproper deletes configuration and all temporary files left by patch,
editors and the like.
The only path to get to del_timer call in __generic_unplug_device() is when
blk_remove_plug() returns 1, and in that case it already removed the
unplug_timer.
Andrew Morton [Fri, 12 Mar 2004 00:15:08 +0000 (16:15 -0800)]
[PATCH] NUMA-aware zonelist builder
From: <j-nomura@ce.jp.nec.com>
The attached patch is NUMA-aware zonelist builder patch, which sorts
zonelist in the order that near-node first, far-node last. In lse-tech and
linux-ia64, where most of NUMA people resides, no objections are raised so
far.
The patch adds NUMA-specific version of build_zonelists which calls
find_next_best_node to select the next-nearest node to add to zonelist.
Andrew Morton [Fri, 12 Mar 2004 00:14:55 +0000 (16:14 -0800)]
[PATCH] kbuild: Remove CFLAGS assignment in i386/mach-*/Makefile
From: Sam Ravnborg <sam@ravnborg.org>
The EXTRA_CFLAGS assignments in the following files are a left-over from
the early 2.5 days where the source was not compiled from the root of the
source tree.
Removing these wrong assignments fixes
http://bugme.osdl.org/show_bug.cgi?id=2210
A script named 'kernel' in the .. directory no longer halt compilation.
Andrew Morton [Fri, 12 Mar 2004 00:14:02 +0000 (16:14 -0800)]
[PATCH] dm: default queue limits
From: Joe Thornber <thornber@redhat.com>
Fill in missing queue limitations when table is complete instead of enforcing
the "default" limits on every dm device. Problem noticed by Mike Christie.
Andrew Morton [Fri, 12 Mar 2004 00:13:40 +0000 (16:13 -0800)]
[PATCH] dm: endio method
From: Joe Thornber <thornber@redhat.com>
Add an endio method to targets. This method is allowed to request another
shot at failed ios (think multipath). Context can be passed between the map
method and the endio method.
Andrew Morton [Fri, 12 Mar 2004 00:13:16 +0000 (16:13 -0800)]
[PATCH] i386 very early memory detection cleanup patch
From: "H. Peter Anvin" <hpa@zytor.com>
This patch cleans up the very early memory setup on the i386 platform. In
particular, it removes the hard-coded 8 MB limit completely by dynamically
creating the early-boot pagetables rather than having them hard coded.
While I was at it, I changed head.S so that it always sets up a local GDT;
this means among other things that SMP and VISWS are no longer special
cases, and is conceptually cleaner to boot. The VISWS people have
confirmed it works on VISWS.
It also uses a separate entrypoint for non-boot processors since this is
completely kernel-internal anyway. This eliminates the need to set %bx on
boot. (If you think this is a bad idea I can eliminate this change; it
just seemed cleaner to me to do it this way.)
Additionally, zero bss with rep;stosl rather that rep;stosb.
Andrew Morton [Fri, 12 Mar 2004 00:12:38 +0000 (16:12 -0800)]
[PATCH] time interpolator fix
From: john stultz <johnstul@us.ibm.com>
In developing the ia64-cyclone patch, which implements a cyclone based time
interpolator, I found the following bug which could cause time
inconsistencies.
In update_wall_time_one_tick(), which is called each timer interrupt, we
call time_interpolator_update(delta_nsec) where delta_nsec is approximately
NSEC_PER_SEC/HZ. This directly correlates with the changes to xtime which
occurs in update_wall_time_one_tick().
However in update_wall_time(), on a second overflow, we again call
time_interpolator_update(NSEC_PER_SEC). However while the components of
xtime are being changed, the overall value of xtime does not (nsec is
decremented NSEC_PER_SEC and sec is incremented). Thus this call to
time_interpolator_update is incorrect.
This patch removes the incorrect call to time_interpolator_update and was
found to resolve the time inconsistencies I had seen while developing the
ia64-cyclone patch.
Andrew Morton [Fri, 12 Mar 2004 00:12:15 +0000 (16:12 -0800)]
[PATCH] read-only support for UFS2
From: Niraj Kumar <niraj17@iitbombay.org>
This patch adds read-only support for ufs2 (used in FreeBSD 5.x) variant of
ufs filesystem. For filesystem specific tools, see
http://ufs-linux.sourceforge.com .
Andrew Morton [Fri, 12 Mar 2004 00:12:05 +0000 (16:12 -0800)]
[PATCH] adaptive lazy readahead
From: Suparna Bhattacharya <suparna@in.ibm.com>
From: Ram Pai <linuxram@us.ibm.com>
Pipelined readahead behaviour is suitable for sequential reads, but not for
large random reads (typical of database workloads), where lazy readahead
provides a big performance boost.
One option (suggested by Andrew Morton) would be to have the application
pass hints to turn off readahead by setting the readahead window to zero
using posix_fadvise64(POSIX_FADV_RANDOM), and to special-case that in
do_generic_mapping_read to completely bypass the readahead logic and
instead read in all the pages needed directly.
This was the idea I started with. But then I thought, we can do a still
better job ? How about adapting the readahead algorithm to lazy-read or
non-lazy-read based on the past i/o patterns ?
The overall idea is to keep track of average number of contiguous pages
accessed in a file. If the average at any given time is above ra->pages
the pattern is sequential. If not the pattern is random. If pattern is
sequential do non-lazy-readahead( read as soon as the first page in the
active window is touched) else do lazy-readahead.
I have studied the behaviour of this patch using my user-level simulator.
It adapts pretty well.
Note from Suparna: This appears to bring streaming AIO read performance for
large (64KB) random AIO reads back to sane values (since the lazy readahead
backout in the mainline).
Andrew Morton [Fri, 12 Mar 2004 00:11:54 +0000 (16:11 -0800)]
[PATCH] readdir() cleanups
From: <viro@parcelfarce.linux.theplanet.co.uk>
cramfs and freevxfs explicitly mark themselves readonly (as other r/o fs
do).
afs marked noatime (ACKed by maintainer)
filesystems that do not do update_atime() in their ->readdir() had been
explicitly marked nodiratime. NOTE: cifs, coda and ncpfs almost certainly
need full noatime as we currently have in nfs and afs.
update_atime() call shifted to callers of ->readdir() and out of
->readdir() instances. Bugs caught:
dcache_readdir() updated atime only if it reached EOF.
Andrew Morton [Fri, 12 Mar 2004 00:11:41 +0000 (16:11 -0800)]
[PATCH] Clean up sys_ioperm stubs
From: Brian Gerst <bgerst@didntduck.org>
Remove stubs for sys_ioperm for non-x86 arches, using sys_ni_syscall
instead where applicable. Support for sys_ioperm is unconditionally no for
non-x86 arches.
Andrew Morton [Fri, 12 Mar 2004 00:11:31 +0000 (16:11 -0800)]
[PATCH] ppc64: fix initialisation of NUMA arrays
From: Anton Blanchard <anton@samba.org>
We were hitting problems on machines with cpu_possible != cpu_online when
NUMA was enabled. The debug checks would trip during scheduler init
because we iterate through all possible cpus whereas we only set up NUMA
information for online cpus.
Longer term we should have a cpu_up hook which sets up its NUMA information
but for now we initalise all possible cpus and memory to node 0.
Andrew Morton [Fri, 12 Mar 2004 00:11:20 +0000 (16:11 -0800)]
[PATCH] print kernel version in oops messages
From: Arjan van de Ven <arjanv@redhat.com>
Unfortunatly a large portion of the oops reports lack the basic
information about what kernel version the oops is for; it's trivial to just
print this in the oops as well to improve the usefulness of bugreports...
Anton Blanchard [Thu, 11 Mar 2004 23:57:24 +0000 (15:57 -0800)]
[PATCH] fix ppc64 in kernel syscalls
Thanks to some great debugging work by Olaf Hering and Marcus Meissner
it has been noticed that the current ppc64 syscall code is corrupting
4 bytes past errno. Why we even bothered to set errno beats me, its
unusable in the kernel.
Since we had to reinstate the inline syscall code we can go back to
using it for those few syscalls that we call. Especially now with
Randy's syscall prototype cleanup we should be calling them directly
but we can do that sometime later.
Jens Axboe [Thu, 11 Mar 2004 23:56:51 +0000 (15:56 -0800)]
[PATCH] CDROMREADAUDIO dma support
This small patch builds on top of the blk_rq_map_user() patch just sent,
and enables us to easily support DMA for CDROMREADAUDIO cdda extraction.
It's quite amazing how much cool stuff you can with the new block layer
:-)
Patch has intelligent fall back from multi frame dma to single frame
dma, and further to old-style pio ripping in case of hardware problems.
Jens Axboe [Thu, 11 Mar 2004 23:56:40 +0000 (15:56 -0800)]
[PATCH] user data -> request mapping
This patch allows you to map a request with user data for io, similarly
to what you can do with bio_map_user() already to a bio. However, this
goes one step further and populates the request so the user only has to
fill in the cdb (almost) and put it on the queue for execution. Patch
converts sg_io() to use it, next patch I'll send adapts cdrom layer to
use it for zero copy cdda dma extraction.