[R] Literature analysis

Ista Zahn istazahn at gmail.com
Fri Dec 11 16:06:37 CET 2009


I think I my original response was bad. And I also realize that I
don't really understand what you want to do.

Here is what I was thinking:

Format your bibliography as CSV (you can convert common bibliography
formats to .csv using Tellico or similar software). Then read the
references into a data frame:

Refs <- read.csv(textConnection('Title,Entry Type,Author,Bibtex
Key,Book Title,Editor,Organization,Publisher,Address,Edition,Pages,Year,ISBN#,Journal,DOI,Month,Number,How
Published,Chapter,Series,Volume,Cross-Reference,Keywords,URL,Abstract,Notes,ID,Date
Created,Date Modified
Unconscious authorship ascription: The effects of success and
effect-specific information priming on experienced
authorship,article,"Aarts, H.",Aarts2007,,,,,,,119–126,2007,,Journal
of Experimental Social Psychology,,,,,,,43,,Lexical Decision;
unconscious; Perception; presentation of words; effect-specific
information; experienced authorship; authorship ascription; priming;
lexical decisions; subliminal perception; Self
Perception,http://localhost/refbase/Papers/Aarts/Unconscious%20authorship%20ascription:%20The%20effects%20of%20success.pdf,Abstract
goes here,"exported from refbase
(http://ista.scp.rochester.edu/refbase/show.php?record=14), last
updated on Tue, 17 Nov 2009 10:23:27 -0500",0,2009-12-11,2009-12-11
Goal contagion: Perceiving is for pursuing,article,"Aarts, H.;
Gollwitzer, P.; Hassin, R.",Aarts_etal2004,,,,,,,23–37,2004,,Journal
of Personality and Social Psychology,,,,,,,87,,Contagion; goal
contagion; automatic goal adion; Motivation; Perception; implied
behavioral goals; goal perception; Theories; Automatism; behavioral
information; goal directedness; Attention; Stimulus Parameters; Goals;
goal characteristics,http://localhost/refbase/Papers/Aarts/Goal%20contagion:%20Perceiving%20is%20for%20pursuing.pdf,Abstract
goes here.,"exported from refbase
(http://ista.scp.rochester.edu/refbase/show.php?record=15), last
updated on Tue, 17 Nov 2009 10:23:27 -0500",1,2009-12-11,2009-12-11'))

closeAllConnections()

(sorry, I know that's going to get all screwed up by word wrapping).
Next split out the keywords into separate columns, and then combine
them again into a value column:

library(ggplot2)
Refs <- as.data.frame(cbind(Refs, colsplit(Refs$Keywords, split="; ",
names="Keyword")))
Refs <- melt(Refs, measure.vars=31:45)

And then that's were I get stuck. I have authors and keywords for each
publication, but I'm not sure how you want this represented.

-Ista

On Fri, Dec 11, 2009 at 9:04 AM, Schwan <s.s.hosseiny at utwente.nl> wrote:
> Thanks, but how should I put the citation inside a data frame?
>
> data.frame(first txt file, second txt file...)
> plot (what should I insert here????) type="p"
>
> And how should I load the txt files anyway inside the frame?
>
>
>
>
>
>
> On Fri, 2009-12-11 at 08:37 -0500, Ista Zahn wrote:
>> It sounds pretty simple so far. Just put the citation info in a data
>> frame, and plot it. I would use ggplot2 for plotting, but it could be
>> done in base or lattice too.
>>
>> On Fri, Dec 11, 2009 at 8:04 AM, Schwan <s.s.hosseiny at utwente.nl> wrote:
>> > Ok good question I havent explain!
>> >
>> > Well,lets keep it simple for the begining. By analyzing the
>> > keywords(which can also include the authors name) I mean,
>> >
>> > 1) investigate in which paper the keyword occur and how often
>> > 2) investigate if keywords occur together
>> >
>> > X axis can show the paper title (or authors name)
>> > y axis shows the keywords
>> >
>> > if a keyword occurs more often in a paper then the dot on the graph
>> > changes the color
>> >
>> > if keywords occur together then a symbol the same symbol appears.
>> >
>> >
>> >
>> >
>> >
>> >
>> >
>> > On Fri, 2009-12-11 at 04:33 -0800, nshephard wrote:
>> >>
>> >>
>> >> Schwan wrote:
>> >> >
>> >> > Thanks for all the comments,
>> >> >
>> >> > and sorry about the unstructured question! I am trying to:
>> >> >
>> >> > 1: analyze keywords, names from Authors and year of publication from
>> >> > citations (with abstracts) i downloaded fron various sites(these
>> >> > downloads can be converted into ".txt" files as well)
>> >>
>> >> Sorry to bang the drum but what do you mean by "analyse", is it to simply
>> >> count keywords and rank them, investigate which keywords occur together,
>> >> look at trends in keywords by year or something else completely.
>> >>
>> >> Its a very general term (at least for me as I've no knowledge or experience
>> >> of text mining)!
>> >>
>> >> Neil
>> >
>> >
>> > --
>> > -------------------------------------------
>> > Hosseiny, MSc. S.S. (Seyed Schwan)
>> > University of Twente Science and Technology
>> > Meander, ME 322
>> > P.O. Box 217 7500 AE Enschede
>> > The Netherlands
>> > Phone +31 534892869
>> > Email:  S.S.Hosseiny at utwente.n
>> >
>> > ______________________________________________
>> > R-help at r-project.org mailing list
>> > https://stat.ethz.ch/mailman/listinfo/r-help
>> > PLEASE do read the posting guide http://www.R-project.org/posting-guide.html
>> > and provide commented, minimal, self-contained, reproducible code.
>> >
>>
>>
>>
>
>
> --
> -------------------------------------------
> Hosseiny, MSc. S.S. (Seyed Schwan)
> University of Twente Science and Technology
> Meander, ME 322
> P.O. Box 217 7500 AE Enschede
> The Netherlands
> Phone +31 534892869
> Email:  S.S.Hosseiny at utwente.n
>



-- 
Ista Zahn
Graduate student
University of Rochester
Department of Clinical and Social Psychology
http://yourpsyche.org




More information about the R-help mailing list