The linguistics of authorship recognition

a_b_desert_king a_b_desert_king at hotmail.com
Sun Dec 26 01:51:41 UTC 2004


No: HPFGUIDX 120609


> snip

> 4. There are some recognized ways to do authorship recognition 
> studies.  In my own graduate work I use my faculty advisor's 
> Vocabulary Management Profiles program to check for word usage 
> (hapax legmena, type/token ratio, etc).  The website is an 
academic 
> one (that is, not for profit) and is:
> www.missouri.edu/~youmansc/vmp/
> You might consider getting some mooseming text, then finding a bit 
> of comparable wording from JKR, say, transcript from a chat or 
> interview, then comparing relevant statistics.  

snip

> 
> Merry Christmas! Happy Holidays!
> Brian Brinkman

Hi Brian,

Since I have nada to do this Christmas Day, I decided to give your 
site a try.  I'm not sure I'm doing it right tho...

I found all of mooseming's posts that I could (very interesting read 
for anyone who would like to see them) and took some of her text 
from her website Rumours and News sections.  Now, this was what I 
got:

VMP2.2: "mooseming files.txt" Interval: 55 
TotalTypes=857  TotalTokens=2413  Types/Tokens=0.3552
AvgR = the ratio of Types / Tokens over the moving interval.
           

Mean avgR = 0.35530
Standard Deviation = 0.06603
Fractal Dimension = 1.480432  

If I did that right, that says to me that there is a pretty close 
match....  But I'm not sure I did it right....

Anyway, if you want to try it yourself I have the files here - just 
e-mail me and I'll happily send it to you to try.  I'm really 
curious now....

Heather - Merry Christmas to you too!







More information about the HPforGrownups archive