From mboxrd@z Thu Jan  1 00:00:00 1970
Return-Path: <owner-linux-mm@kvack.org>
Received: from mail-pa0-f42.google.com (mail-pa0-f42.google.com [209.85.220.42])
	by kanga.kvack.org (Postfix) with ESMTP id 225826B0037
	for <linux-mm@kvack.org>; Tue, 22 Jul 2014 04:05:38 -0400 (EDT)
Received: by mail-pa0-f42.google.com with SMTP id lf10so11518988pab.15
        for <linux-mm@kvack.org>; Tue, 22 Jul 2014 01:05:37 -0700 (PDT)
Received: from na01-bn1-obe.outbound.protection.outlook.com (mail-bn1lp0139.outbound.protection.outlook.com. [207.46.163.139])
        by mx.google.com with ESMTPS id bz3si16815148pbd.157.2014.07.22.01.05.35
        for <linux-mm@kvack.org>
        (version=TLSv1 cipher=ECDHE-RSA-AES128-SHA bits=128/128);
        Tue, 22 Jul 2014 01:05:36 -0700 (PDT)
Message-ID: <53CE1B3D.7070304@amd.com>
Date: Tue, 22 Jul 2014 11:05:17 +0300
From: Oded Gabbay <oded.gabbay@amd.com>
MIME-Version: 1.0
Subject: Re: [PATCH v2 00/25] AMDKFD kernel driver
References: <53CD17FD.3000908@vodafone.de> <53CD1FB6.1000602@amd.com>
 <20140721155437.GA4519@gmail.com> <53CD5122.5040804@amd.com>
 <20140721181433.GA5196@gmail.com> <53CD5DBC.7010301@amd.com>
 <20140721185940.GA5278@gmail.com> <53CD68BF.4020308@amd.com>
 <20140721192837.GC5278@gmail.com> <53CD8C7D.9010106@amd.com>
 <20140721230535.GA6422@gmail.com>
In-Reply-To: <20140721230535.GA6422@gmail.com>
Content-Type: text/plain; charset="ISO-8859-1"; format=flowed
Content-Transfer-Encoding: quoted-printable
Sender: owner-linux-mm@kvack.org
List-ID: <linux-mm.kvack.org>
To: Jerome Glisse <j.glisse@gmail.com>
Cc: Andrew Lewycky <Andrew.Lewycky@amd.com>, linux-mm <linux-mm@kvack.org>, =?ISO-8859-1?Q?Michel_D=E4nzer?= <michel.daenzer@amd.com>, "linux-kernel@vger.kernel.org" <linux-kernel@vger.kernel.org>, "dri-devel@lists.freedesktop.org" <dri-devel@lists.freedesktop.org>, Alexey
 Skidanov <Alexey.Skidanov@amd.com>, Andrew Morton <akpm@linux-foundation.org>, Dave Airlie <airlied@redhat.com>, "Bridgman,
 John" <John.Bridgman@amd.com>, "Deucher, Alexander" <Alexander.Deucher@amd.com>, Joerg Roedel <joro@8bytes.org>, Ben Goz <Ben.Goz@amd.com>, =?ISO-8859-1?Q?Christian_K=F6nig?= <christian.koenig@amd.com>, Daniel Vetter <daniel@ffwll.ch>, "Sellek, Tom" <Tom.Sellek@amd.com>

On 22/07/14 02:05, Jerome Glisse wrote:
> On Tue, Jul 22, 2014 at 12:56:13AM +0300, Oded Gabbay wrote:
>> On 21/07/14 22:28, Jerome Glisse wrote:
>>> On Mon, Jul 21, 2014 at 10:23:43PM +0300, Oded Gabbay wrote:
>>>> On 21/07/14 21:59, Jerome Glisse wrote:
>>>>> On Mon, Jul 21, 2014 at 09:36:44PM +0300, Oded Gabbay wrote:
>>>>>> On 21/07/14 21:14, Jerome Glisse wrote:
>>>>>>> On Mon, Jul 21, 2014 at 08:42:58PM +0300, Oded Gabbay wrote:
>>>>>>>> On 21/07/14 18:54, Jerome Glisse wrote:
>>>>>>>>> On Mon, Jul 21, 2014 at 05:12:06PM +0300, Oded Gabbay wrote:
>>>>>>>>>> On 21/07/14 16:39, Christian K=F6nig wrote:
>>>>>>>>>>> Am 21.07.2014 14:36, schrieb Oded Gabbay:
>>>>>>>>>>>> On 20/07/14 20:46, Jerome Glisse wrote:
>>>>>>>>>>>>> On Thu, Jul 17, 2014 at 04:57:25PM +0300, Oded Gabbay wrote=
:
>>>>>>>>>>>>>> Forgot to cc mailing list on cover letter. Sorry.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> As a continuation to the existing discussion, here is a v2=
 patch series
>>>>>>>>>>>>>> restructured with a cleaner history and no totally-differe=
nt-early-versions
>>>>>>>>>>>>>> of the code.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Instead of 83 patches, there are now a total of 25 patches=
, where 5 of them
>>>>>>>>>>>>>> are modifications to radeon driver and 18 of them include =
only amdkfd code.
>>>>>>>>>>>>>> There is no code going away or even modified between patch=
es, only added.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> The driver was renamed from radeon_kfd to amdkfd and moved=
 to reside under
>>>>>>>>>>>>>> drm/radeon/amdkfd. This move was done to emphasize the fac=
t that this driver
>>>>>>>>>>>>>> is an AMD-only driver at this point. Having said that, we =
do foresee a
>>>>>>>>>>>>>> generic hsa framework being implemented in the future and =
in that case, we
>>>>>>>>>>>>>> will adjust amdkfd to work within that framework.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> As the amdkfd driver should support multiple AMD gfx drive=
rs, we want to
>>>>>>>>>>>>>> keep it as a seperate driver from radeon. Therefore, the a=
mdkfd code is
>>>>>>>>>>>>>> contained in its own folder. The amdkfd folder was put und=
er the radeon
>>>>>>>>>>>>>> folder because the only AMD gfx driver in the Linux kernel=
 at this point
>>>>>>>>>>>>>> is the radeon driver. Having said that, we will probably n=
eed to move it
>>>>>>>>>>>>>> (maybe to be directly under drm) after we integrate with a=
dditional AMD gfx
>>>>>>>>>>>>>> drivers.
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> For people who like to review using git, the v2 patch set =
is located at:
>>>>>>>>>>>>>> http://cgit.freedesktop.org/~gabbayo/linux/log/?h=3Dkfd-ne=
xt-3.17-v2
>>>>>>>>>>>>>>
>>>>>>>>>>>>>> Written by Oded Gabbayh <oded.gabbay@amd.com>
>>>>>>>>>>>>>
>>>>>>>>>>>>> So quick comments before i finish going over all patches. T=
here is many
>>>>>>>>>>>>> things that need more documentation espacialy as of right n=
ow there is
>>>>>>>>>>>>> no userspace i can go look at.
>>>>>>>>>>>> So quick comments on some of your questions but first of all=
, thanks for the
>>>>>>>>>>>> time you dedicated to review the code.
>>>>>>>>>>>>>
>>>>>>>>>>>>> There few show stopper, biggest one is gpu memory pinning t=
his is a big
>>>>>>>>>>>>> no, that would need serious arguments for any hope of convi=
ncing me on
>>>>>>>>>>>>> that side.
>>>>>>>>>>>> We only do gpu memory pinning for kernel objects. There are =
no userspace
>>>>>>>>>>>> objects that are pinned on the gpu memory in our driver. If =
that is the case,
>>>>>>>>>>>> is it still a show stopper ?
>>>>>>>>>>>>
>>>>>>>>>>>> The kernel objects are:
>>>>>>>>>>>> - pipelines (4 per device)
>>>>>>>>>>>> - mqd per hiq (only 1 per device)
>>>>>>>>>>>> - mqd per userspace queue. On KV, we support up to 1K queues=
 per process, for
>>>>>>>>>>>> a total of 512K queues. Each mqd is 151 bytes, but the alloc=
ation is done in
>>>>>>>>>>>> 256 alignment. So total *possible* memory is 128MB
>>>>>>>>>>>> - kernel queue (only 1 per device)
>>>>>>>>>>>> - fence address for kernel queue
>>>>>>>>>>>> - runlists for the CP (1 or 2 per device)
>>>>>>>>>>>
>>>>>>>>>>> The main questions here are if it's avoid able to pin down th=
e memory and if the
>>>>>>>>>>> memory is pinned down at driver load, by request from userspa=
ce or by anything
>>>>>>>>>>> else.
>>>>>>>>>>>
>>>>>>>>>>> As far as I can see only the "mqd per userspace queue" might =
be a bit
>>>>>>>>>>> questionable, everything else sounds reasonable.
>>>>>>>>>>>
>>>>>>>>>>> Christian.
>>>>>>>>>>
>>>>>>>>>> Most of the pin downs are done on device initialization.
>>>>>>>>>> The "mqd per userspace" is done per userspace queue creation. =
However, as I
>>>>>>>>>> said, it has an upper limit of 128MB on KV, and considering th=
e 2G local
>>>>>>>>>> memory, I think it is OK.
>>>>>>>>>> The runlists are also done on userspace queue creation/deletio=
n, but we only
>>>>>>>>>> have 1 or 2 runlists per device, so it is not that bad.
>>>>>>>>>
>>>>>>>>> 2G local memory ? You can not assume anything on userside confi=
guration some
>>>>>>>>> one might build an hsa computer with 512M and still expect a fu=
nctioning
>>>>>>>>> desktop.
>>>>>>>> First of all, I'm only considering Kaveri computer, not "hsa" co=
mputer.
>>>>>>>> Second, I would imagine we can build some protection around it, =
like
>>>>>>>> checking total local memory and limit number of queues based on =
some
>>>>>>>> percentage of that total local memory. So, if someone will have =
only
>>>>>>>> 512M, he will be able to open less queues.
>>>>>>>>
>>>>>>>>
>>>>>>>>>
>>>>>>>>> I need to go look into what all this mqd is for, what it does a=
nd what it is
>>>>>>>>> about. But pinning is really bad and this is an issue with user=
space command
>>>>>>>>> scheduling an issue that obviously AMD fails to take into accou=
nt in design
>>>>>>>>> phase.
>>>>>>>> Maybe, but that is the H/W design non-the-less. We can't very we=
ll
>>>>>>>> change the H/W.
>>>>>>>
>>>>>>> You can not change the hardware but it is not an excuse to allow =
bad design to
>>>>>>> sneak in software to work around that. So i would rather penalize=
 bad hardware
>>>>>>> design and have command submission in the kernel, until AMD fix i=
ts hardware to
>>>>>>> allow proper scheduling by the kernel and proper control by the k=
ernel.
>>>>>> I'm sorry but I do *not* think this is a bad design. S/W schedulin=
g in
>>>>>> the kernel can not, IMO, scale well to 100K queues and 10K process=
es.
>>>>>
>>>>> I am not advocating for having kernel decide down to the very last =
details. I am
>>>>> advocating for kernel being able to preempt at any time and be able=
 to decrease
>>>>> or increase user queue priority so overall kernel is in charge of r=
esources
>>>>> management and it can handle rogue client in proper fashion.
>>>>>
>>>>>>
>>>>>>> Because really where we want to go is having GPU closer to a CPU =
in term of scheduling
>>>>>>> capacity and once we get there we want the kernel to always be ab=
le to take over
>>>>>>> and do whatever it wants behind process back.
>>>>>> Who do you refer to when you say "we" ? AFAIK, the hw scheduling
>>>>>> direction is where AMD is now and where it is heading in the futur=
e.
>>>>>> That doesn't preclude the option to allow the kernel to take over =
and do
>>>>>> what he wants. I agree that in KV we have a problem where we can't=
 do a
>>>>>> mid-wave preemption, so theoretically, a long running compute kern=
el can
>>>>>> make things messy, but in Carrizo, we will have this ability. Havi=
ng
>>>>>> said that, it will only be through the CP H/W scheduling. So AMD i=
s
>>>>>> _not_ going to abandon H/W scheduling. You can dislike it, but thi=
s is
>>>>>> the situation.
>>>>>
>>>>> We was for the overall Linux community but maybe i should not prete=
nd to talk
>>>>> for anyone interested in having a common standard.
>>>>>
>>>>> My point is that current hardware do not have approriate hardware s=
upport for
>>>>> preemption hence, current hardware should use ioctl to schedule job=
 and AMD
>>>>> should think a bit more on commiting to a design and handwaving any=
 hardware
>>>>> short coming as something that can be work around in the software. =
The pinning
>>>>> thing is broken by design, only way to work around it is through ke=
rnel cmd
>>>>> queue scheduling that's a fact.
>>>>
>>>>>
>>>>> Once hardware support proper preemption and allows to move around/e=
vict buffer
>>>>> use on behalf of userspace command queue then we can allow userspac=
e scheduling
>>>>> but until then my personnal opinion is that it should not be allowe=
d and that
>>>>> people will have to pay the ioctl price which i proved to be small,=
 because
>>>>> really if you 100K queue each with one job, i would not expect that=
 all those
>>>>> 100K job will complete in less time than it takes to execute an ioc=
tl ie by
>>>>> even if you do not have the ioctl delay what ever you schedule will=
 have to
>>>>> wait on previously submited jobs.
>>>>
>>>> But Jerome, the core problem still remains in effect, even with your
>>>> suggestion. If an application, either via userspace queue or via ioc=
tl,
>>>> submits a long-running kernel, than the CPU in general can't stop th=
e
>>>> GPU from running it. And if that kernel does while(1); than that's i=
t,
>>>> game's over, and no matter how you submitted the work. So I don't re=
ally
>>>> see the big advantage in your proposal. Only in CZ we can stop this =
wave
>>>> (by CP H/W scheduling only). What are you saying is basically I won'=
t
>>>> allow people to use compute on Linux KV system because it _may_ get =
the
>>>> system stuck.
>>>>
>>>> So even if I really wanted to, and I may agree with you theoreticall=
y on
>>>> that, I can't fulfill your desire to make the "kernel being able to
>>>> preempt at any time and be able to decrease or increase user queue
>>>> priority so overall kernel is in charge of resources management and =
it
>>>> can handle rogue client in proper fashion". Not in KV, and I guess n=
ot
>>>> in CZ as well.
>>>>
>>>> 	Oded
>>>
>>> I do understand that but using kernel ioctl provide the same kind of =
control
>>> as we have now ie we can bind/unbind buffer on per command buffer sub=
mission
>>> basis, just like with current graphic or compute stuff.
>>>
>>> Yes current graphic and compute stuff can launch a while and never re=
turn back
>>> and yes currently we have nothing against that but we should and solu=
tion would
>>> be simple just kill the gpu thread.
>>>
>> OK, so in that case, the kernel can simple unmap all the queues by
>> simply writing an UNMAP_QUEUES packet to the HIQ. Even if the queues a=
re
>> userspace, they will not be mapped to the internal CP scheduler.
>> Does that satisfy the kernel control level you want ?
>
> This raises questions, what does happen to currently running thread whe=
n you
> unmap queue ? Do they keep running until done ? If not than this means =
this
> will break user application and those is not an acceptable solution.

They keep running until they are done. However, their submission of workl=
oads to=20
their queues has no effect, of course.
Maybe I should explain how this works from the userspace POV. When the us=
erspace=20
app wants to submit a work to the queue, it writes to 2 different locatio=
ns, the=20
doorbell and a wptr shadow (which is in system memory, viewable by the GP=
U).=20
Every write to the doorbell triggers the CP (and other stuff) in the GPU.=
 The CP=20
then checks if the doorbell's queue is mapped. If so, than it handles thi=
s=20
write. If not, it simply ignores it.
So, when we do unmap queues, the CP will ignore the doorbell writes by th=
e=20
userspace app, however the app will not know that (unless it specifically=
 waits=20
for results). When the queue is re-mapped, the CP will take the wptr shad=
ow and=20
use that to re-synchronize itself with the queue.

>
> Otherwise, infrastructre inside radeon would be needed to force this qu=
eue
> unmap on bo_pin failure so gfx pinning can be retry.
If we fail to bo_pin than we of course unmap the queue and return -ENOMEM=
.
I would like to add another information here that is relevant. I checked =
the=20
code again, and the "mqd per userspace queue" allocation is done only on=20
RADEON_GEM_DOMAIN_GTT, which AFAIK is *system memory* that is also mapped=
 (and=20
pinned) on the GART address space. Does that still counts as GPU memory f=
rom=20
your POV ? Are you really concern about GART address space being exhauste=
d ?

Moreover, in all of our code, I don't see us using RADEON_GEM_DOMAIN_VRAM=
. We=20
have a function in radeon_kfd.c called pool_to_domain, and you can see th=
ere=20
that we map KGD_POOL_FRAMEBUFFER to RADEON_GEM_DOMAIN_VRAM. However, if y=
ou=20
search for KGD_POOL_FRAMEBUFFER, you will see that we don't use it anywhe=
re.
>
> Also how do you cope with doorbell exhaustion ? Do you just plan to err=
or out ?
> In which case this is another DDOS vector but only affecting the gpu.
Yes, we plan to error out, but I don't see how we can defend from that. F=
or a=20
single process, we limit the queues to be 1K (as we assign 1 doorbell pag=
e per=20
process, and each doorbell is 4 bytes). However, if someone would fork a =
lot of=20
processes, and each of them will register and open 1K queues, than that w=
ould be=20
a problem. But how can we recognize such an event and differentiate it fr=
om=20
normal operation ? Did you have something specific in mind ?
>
> And there is many other questions that need answer, like my kernel memo=
ry map
> question because as of right now i assume that kfd allow any thread on =
the gpu
> to access any kernel memory.
Actually, no. We don't allow any access from gpu kernels to the Linux ker=
nel=20
memory.

Let me explain more. In KV, the GPU is responsible of telling the IOMMU w=
hether=20
the access is privileged or not. If the access is privileged, than the IO=
MMU can=20
allow the GPU to access kernel memory. However, we never configure the GP=
U in=20
our driver to issue privileged accesses. In CZ, this is solved by configu=
ring=20
the IOMMU to not allow privileged accesses.

>
> Otherthings are how ill formated packet are handled by the hardware ? I=
 do not
> see any mecanism to deal with SIGBUS or SIGFAULT.
You are correct when you say you don't see any mechanism. We are now deve=
loping=20
it :) Basically, there will be two new modules. The first one is the even=
t=20
module, which is already written and working. The second module is the ex=
ception=20
handling module, which is now being developed and will be build upon the =
event=20
module. The exception handling module will take care of ill formated pack=
ets and=20
other exceptions from the GPU (that are not handled by radeon).
>
>
> Also it is a worrisome prospect of seeing resource management completel=
y ignore
> for future AMD hardware. Kernel exist for a reason ! Kernel main purpos=
e is to
> provide resource management if AMD fails to understand that, this is no=
t looking
> good on long term and i expect none of the HSA technology will get mome=
ntum and
> i would certainly advocate against any use of it inside product i work =
on.
>
So I made a mistake in writing that: "Not in KV, and I guess not in CZ as=
 well"=20
and I apologize for misleading you. What I needed to write was:

"In KV, as a first generation HSA APU, we have limited ability to allow t=
he=20
kernel to preempt at any time and control user queue priority. However, i=
n CZ we=20
have dramatically improved control and resource management capabilities, =
that=20
will allow the kernel to preempt at any time and also control user queue =
priority."

So, as you can see, AMD fully understands that the kernel main purpose is=
 to=20
provide resource management and I hope this will make you recommend AMD H=
/W now=20
and in the future.

	Oded

> Cheers,
> J=E9r=F4me
>
>>
>> 	Oded
>>>>
>>>>>
>>>>>>>
>>>>>>>>>>>
>>>>>>>>>>>>>
>>>>>>>>>>>>> It might be better to add a drivers/gpu/drm/amd directory a=
nd add common
>>>>>>>>>>>>> stuff there.
>>>>>>>>>>>>>
>>>>>>>>>>>>> Given that this is not intended to be final HSA api AFAICT =
then i would
>>>>>>>>>>>>> say this far better to avoid the whole kfd module and add i=
octl to radeon.
>>>>>>>>>>>>> This would avoid crazy communication btw radeon and kfd.
>>>>>>>>>>>>>
>>>>>>>>>>>>> The whole aperture business needs some serious explanation.=
 Especialy as
>>>>>>>>>>>>> you want to use userspace address there is nothing to preve=
nt userspace
>>>>>>>>>>>>> program from allocating things at address you reserve for l=
ds, scratch,
>>>>>>>>>>>>> ... only sane way would be to move those lds, scratch insid=
e the virtual
>>>>>>>>>>>>> address reserved for kernel (see kernel memory map).
>>>>>>>>>>>>>
>>>>>>>>>>>>> The whole business of locking performance counter for exclu=
sive per process
>>>>>>>>>>>>> access is a big NO. Which leads me to the questionable usef=
ullness of user
>>>>>>>>>>>>> space command ring.
>>>>>>>>>>>> That's like saying: "Which leads me to the questionable usef=
ulness of HSA". I
>>>>>>>>>>>> find it analogous to a situation where a network maintainer =
nacking a driver
>>>>>>>>>>>> for a network card, which is slower than a different network=
 card. Doesn't
>>>>>>>>>>>> seem reasonable this situation is would happen. He would sti=
ll put both the
>>>>>>>>>>>> drivers in the kernel because people want to use the H/W and=
 its features. So,
>>>>>>>>>>>> I don't think this is a valid reason to NACK the driver.
>>>>>>>>>
>>>>>>>>> Let me rephrase, drop the the performance counter ioctl and mod=
ulo memory pinning
>>>>>>>>> i see no objection. In other word, i am not NACKING whole patch=
set i am NACKING
>>>>>>>>> the performance ioctl.
>>>>>>>>>
>>>>>>>>> Again this is another argument for round trip to the kernel. As=
 inside kernel you
>>>>>>>>> could properly do exclusive gpu counter access accross single u=
ser cmd buffer
>>>>>>>>> execution.
>>>>>>>>>
>>>>>>>>>>>>
>>>>>>>>>>>>> I only see issues with that. First and foremost i would
>>>>>>>>>>>>> need to see solid figures that kernel ioctl or syscall has =
a higher an
>>>>>>>>>>>>> overhead that is measurable in any meaning full way against=
 a simple
>>>>>>>>>>>>> function call. I know the userspace command ring is a big m=
arketing features
>>>>>>>>>>>>> that please ignorant userspace programmer. But really this =
only brings issues
>>>>>>>>>>>>> and for absolutely not upside afaict.
>>>>>>>>>>>> Really ? You think that doing a context switch to kernel spa=
ce, with all its
>>>>>>>>>>>> overhead, is _not_ more expansive than just calling a functi=
on in userspace
>>>>>>>>>>>> which only puts a buffer on a ring and writes a doorbell ?
>>>>>>>>>
>>>>>>>>> I am saying the overhead is not that big and it probably will n=
ot matter in most
>>>>>>>>> usecase. For instance i did wrote the most useless kernel modul=
e that add two
>>>>>>>>> number through an ioctl (http://people.freedesktop.org/~glisse/=
adder.tar) and
>>>>>>>>> it takes ~0.35microseconds with ioctl while function is ~0.025m=
icroseconds so
>>>>>>>>> ioctl is 13 times slower.
>>>>>>>>>
>>>>>>>>> Now if there is enough data that shows that a significant perce=
ntage of jobs
>>>>>>>>> submited to the GPU will take less that 0.35microsecond then ye=
s userspace
>>>>>>>>> scheduling does make sense. But so far all we have is handwavin=
g with no data
>>>>>>>>> to support any facts.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Now if we want to schedule from userspace than you will need to=
 do something
>>>>>>>>> about the pinning, something that gives control to kernel so th=
at kernel can
>>>>>>>>> unpin when it wants and move object when it wants no matter wha=
t userspace is
>>>>>>>>> doing.
>>>>>>>>>
>>>>>>>>>>>>>
>>>>>
>>>>> --
>>>>> To unsubscribe, send a message with 'unsubscribe linux-mm' in
>>>>> the body to majordomo@kvack.org.  For more info on Linux MM,
>>>>> see: http://www.linux-mm.org/ .
>>>>> Don't email: <a href=3Dmailto:"dont@kvack.org"> email@kvack.org </a=
>
>>>>>
>>>>
>>
>> _______________________________________________
>> dri-devel mailing list
>> dri-devel@lists.freedesktop.org
>> http://lists.freedesktop.org/mailman/listinfo/dri-devel

--
To unsubscribe, send a message with 'unsubscribe linux-mm' in
the body to majordomo@kvack.org.  For more info on Linux MM,
see: http://www.linux-mm.org/ .
Don't email: <a href=mailto:"dont@kvack.org"> email@kvack.org </a>