Skip to content

Encoding Issue for GoogleNewsSource, RCurl #11

@mannau

Description

@mannau

when I try to modify parameters such as edition, country, or language, I get a number of errors like the following:

googln <- WebCorpus(GoogleNewsSource('Μπαλτάς',params=list(language='gr'))

Error: 1: EntityRef: expecting ';'
2: EntityRef: expecting ';'
3: EntityRef: expecting ';'
4: EntityRef: expecting ';'
5: EntityRef: expecting ';'
6: EntityRef: expecting ';'
7: xmlParseEntityRef: no name
8: xmlParseEntityRef: no name
9: error parsing attribute name
10: attributes construct error
11: Couldn't find end of Start Tag c line 1
> sessionInfo()
R version 3.1.2 (2014-10-31)
Platform: x86_64-w64-mingw32/x64 (64-bit)

locale:
[1] LC_COLLATE=Greek_Greece.1253  LC_CTYPE=Greek_Greece.1253   
[3] LC_MONETARY=Greek_Greece.1253 LC_NUMERIC=C                 
[5] LC_TIME=Greek_Greece.1253    

attached base packages:
[1] stats     graphics  grDevices utils     datasets  methods   base     

other attached packages:
 [1] tm.plugin.webmining_1.3 tm_0.6                  NLP_0.1-6              
 [4] RDSTK_1.1               RCurl_1.95-4.5          bitops_1.0-6           
 [7] rjson_0.2.15            xts_0.9-7               zoo_1.7-11             
[10] lubridate_1.3.3         stringr_1.0.0           plyr_1.8.3             
[13] XML_3.98-1.1           

loaded via a namespace (and not attached):
 [1] boilerpipeR_1.3 digest_0.6.8    grid_3.1.2      lattice_0.20-29 magrittr_1.5   
 [6] memoise_0.2.1   parallel_3.1.2  Rcpp_0.12.1     rJava_0.9-6     RJSONIO_1.3-0  
[11] slam_0.1-32     stringi_0.5-5   tools_3.1.2    

Metadata

Metadata

Assignees

No one assigned

    Labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions