| Title: | Text & Scientometric Analytics |
|---|---|
| Description: | It involves bibliometric indicators calculation from bibliometric data.It also deals pattern analysis using the text part of bibliometric data.The bibliometric data are obtained from mainly Web of Science and Scopus. |
| Authors: | Ashraf Uddin |
| Maintainer: | Ashraf Uddin <[email protected]> |
| License: | GPL-3 |
| Version: | 0.1 |
| Built: | 2026-05-15 06:33:42 UTC |
| Source: | https://github.com/cran/scientoText |
It finds top author names and their different performance indicators
author_info(authors, citations, sep, top = 10, only_first_author = F)author_info(authors, citations, sep, top = 10, only_first_author = F)
authors |
A character vector containing author names |
citations |
A numeric vector containing citations |
sep |
A character pattern separating author names |
top |
The number of top authors |
only_first_author |
Logical. If to find the author list by the first authors |
A list consisting of author names, total instances, total citations, h index, g index, i10 index, max citation
authors<-c("Wolf W.R., Lele S.K.", "Shin D., Yeh X., Khatib O.", "Aukes D., Heyneman B., Duchaine V., Cutkosky M.R.") author_info(authors,c(3,4,1),',')authors<-c("Wolf W.R., Lele S.K.", "Shin D., Yeh X., Khatib O.", "Aukes D., Heyneman B., Duchaine V., Cutkosky M.R.") author_info(authors,c(3,4,1),',')
It finds year-wise co-authorship matrix and average co-authorship values
authorship_pattern(authors, pub_years, sep)authorship_pattern(authors, pub_years, sep)
authors |
A character vector containing author names |
pub_years |
A numeric vector containing publication years |
sep |
A character pattern separating author names |
A list with co-authorship matrix and average co-authorship values
authors<-c("Wolf W.R., Lele S.K.", "Shin D., Yeh X., Khatib O.", "Aukes D., Heyneman B., Duchaine V., Cutkosky M.R.") years<-c(2011,2012,2012) authorship_pattern(authors,years,',')authors<-c("Wolf W.R., Lele S.K.", "Shin D., Yeh X., Khatib O.", "Aukes D., Heyneman B., Duchaine V., Cutkosky M.R.") years<-c(2011,2012,2012) authorship_pattern(authors,years,',')
Citations and Cited Instances
citation_info(citations, pub_years)citation_info(citations, pub_years)
citations |
A numeric vector containing citations |
pub_years |
A numeric vector containing publication years |
return year-wise total instances (tp), cited instances and total citations (tc)
citation_info(c(1,3,0,4,2,3,1,0),c(2012,2012,2012,2013,2012,2011,2014,2014))citation_info(c(1,3,0,4,2,3,1,0),c(2012,2012,2012,2013,2012,2011,2014,2014))
Country-wise and year-wise output for a defined period.
country_pattern(affiliations, pub_years = NULL, countries = NULL, only_first_author = F)country_pattern(affiliations, pub_years = NULL, countries = NULL, only_first_author = F)
affiliations |
A text vector containing affiliation (country) information |
pub_years |
A numeric vector containing publication years |
countries |
A list of countries (optional) |
only_first_author |
Logical. If to find the author list by the first authors |
The function returns year and country-wise output matrix if the publication years are provided. If only affliation data is provided the country-wise output is returned as a single vector instead of a matrix.
A list containing country output and other details.
affiliations<-c("Stanford University, Stanford, CA, United States; Montreal, QC, Canada", "Stanford University, United States; Google Inc., United States", "University of Michigan, Ann Arbor, MI 48109-2122, United States; Tsinghua University, Beijing 100084, China", "Imperial College London, London, SW7 2BZ, United Kingdom; ENSTA, Ecole Polytechnique, Palaiseau, 91761, France") pub_years<-c(2012,2012,2013,2014) country_pattern( affiliations, pub_years) country_pattern(affiliations)affiliations<-c("Stanford University, Stanford, CA, United States; Montreal, QC, Canada", "Stanford University, United States; Google Inc., United States", "University of Michigan, Ann Arbor, MI 48109-2122, United States; Tsinghua University, Beijing 100084, China", "Imperial College London, London, SW7 2BZ, United Kingdom; ENSTA, Ecole Polytechnique, Palaiseau, 91761, France") pub_years<-c(2012,2012,2013,2014) country_pattern( affiliations, pub_years) country_pattern(affiliations)
Find h index for a given set of documents
h_index(citations)h_index(citations)
citations |
A numeric vector containing citations |
return the h index for the given citations
Hirsch, J. E. (2005). An index to quantify an individual's scientific research output. Proceedings of the National academy of Sciences of the United States of America, 102(46), 16569-16572.
h_index(c(1,2,5,0,3,11))h_index(c(1,2,5,0,3,11))
It finds the number of highly cited instances year-wise.
highly_cited(citations, pub_years, ref_citations = NULL, ref_pub_years = NULL, top = NULL, year_lim = list())highly_cited(citations, pub_years, ref_citations = NULL, ref_pub_years = NULL, top = NULL, year_lim = list())
citations |
A numeric vector containing citations |
pub_years |
A numeric vector containing publication years |
ref_citations |
The citations of reference instances |
ref_pub_years |
The publication years of reference instances |
top |
An integer which defines top percent highly cited instances |
year_lim |
A list conating years and year-wise citation threshold. If not mentioned these values are calculated from ref_citations, ref_pub_years & top. |
Returns a list containing number of top highly cited instances with other details
citations<-c(2,0,12,3,1,1,4,5,8,2) pub_years<-c(2011,2011,2012,2011,2013,2011,2011,2012,2011,2013) ref_citations<-c(3,0,12,3,1,1,41,5,8,2,2,0,12,30,1,1,4,5,8,12) ref_pub_years<-c(2012,2011,2012,2013,2013,2011,2011,2012, 2011,2013,2011,2011,2012,2011,2013,2011,2011,2012,2011,2013) highly_cited(citations,pub_years,ref_citations,ref_pub_years,10) highly_cited(citations,pub_years,year_lim = list(c(2011, 2012, 2013), c(41, 12, 12)))citations<-c(2,0,12,3,1,1,4,5,8,2) pub_years<-c(2011,2011,2012,2011,2013,2011,2011,2012,2011,2013) ref_citations<-c(3,0,12,3,1,1,41,5,8,2,2,0,12,30,1,1,4,5,8,12) ref_pub_years<-c(2012,2011,2012,2013,2013,2011,2011,2012, 2011,2013,2011,2011,2012,2011,2013,2011,2011,2012,2011,2013) highly_cited(citations,pub_years,ref_citations,ref_pub_years,10) highly_cited(citations,pub_years,year_lim = list(c(2011, 2012, 2013), c(41, 12, 12)))
Calculate the number of Internationally Collaborated Papers
international_col(affiliations, pub_years = NULL, countries = NULL)international_col(affiliations, pub_years = NULL, countries = NULL)
affiliations |
A text vector containing affiliation (country) information |
pub_years |
A numeric vector containing publication years |
countries |
A list of countries (optional) |
It finds if there is any International Collaboration so affiliation fields must have country information
Collaboration count or a list (collaboration counts year-wise)
affiliations<-c("Stanford University, Stanford, CA, United States; Montreal, QC, Canada", "Stanford University, United States; Google Inc., United States", "University of Michigan, Ann Arbor, MI 48109-2122, United States; Tsinghua University, Beijing 100084, China", "Imperial College London, London, SW7 2BZ, United Kingdom; ENSTA, Ecole Polytechnique, Palaiseau, 91761, France") pub_years<-c(2012,2012,2013,2014) international_col( affiliations, pub_years) international_col(affiliations)affiliations<-c("Stanford University, Stanford, CA, United States; Montreal, QC, Canada", "Stanford University, United States; Google Inc., United States", "University of Michigan, Ann Arbor, MI 48109-2122, United States; Tsinghua University, Beijing 100084, China", "Imperial College London, London, SW7 2BZ, United Kingdom; ENSTA, Ecole Polytechnique, Palaiseau, 91761, France") pub_years<-c(2012,2012,2013,2014) international_col( affiliations, pub_years) international_col(affiliations)
Calculate Internationally Collaborated Matrix(es)
international_colmat(affiliations, pub_years = NULL, countries = NULL)international_colmat(affiliations, pub_years = NULL, countries = NULL)
affiliations |
A text vector containing affiliation (country) information |
pub_years |
A numeric vector containing publication years |
countries |
A list of countries (optional) |
It finds the collaboration network at internationla level in terms of adjacent matrix so affiliation fields must have country information
Collaboration adjacent matrix(es)
affiliations<-c("Stanford University, Stanford, CA, United States; Montreal, QC, Canada", "Stanford University, United States; Google Inc., United States", "University of Michigan, Ann Arbor, MI 48109-2122, United States; Tsinghua University, Beijing 100084, China", "Imperial College London, London, SW7 2BZ, United Kingdom; ENSTA, Ecole Polytechnique, Palaiseau, 91761, France") pub_years<-c(2012,2012,2013,2014) international_colmat( affiliations, pub_years) international_colmat(affiliations)affiliations<-c("Stanford University, Stanford, CA, United States; Montreal, QC, Canada", "Stanford University, United States; Google Inc., United States", "University of Michigan, Ann Arbor, MI 48109-2122, United States; Tsinghua University, Beijing 100084, China", "Imperial College London, London, SW7 2BZ, United Kingdom; ENSTA, Ecole Polytechnique, Palaiseau, 91761, France") pub_years<-c(2012,2012,2013,2014) international_colmat( affiliations, pub_years) international_colmat(affiliations)
Term Frequency
term_freq(text, pub_years = NULL, sep = NULL, top = NULL)term_freq(text, pub_years = NULL, sep = NULL, top = NULL)
text |
A character vector |
pub_years |
A numeric vector containing publication years |
sep |
A character value whcih separates the terms (optional) |
top |
The number of terms to return |
Term frequencey vector or matrix (for year-wise)