Skip to content

Latest commit

 

History

History
104 lines (74 loc) · 2.66 KB

tutorial.md

File metadata and controls

104 lines (74 loc) · 2.66 KB
title author date output
hIndexOverYears tutorial
Jose V. Die
February 18, 2016
html_document
keep_md
true

This is a markdown tutorial for the hIndexOverYears package. It is based on the overview of citations from a given author. Data has to be obtained from Scoups Database.

Corrections

Improvements and corrections to this document can be submitted on its GitHub in its repository.

Data set

  • Get the list of documents written by a given author and click on View citation overview.

  • Set Data range. Starting year corresponds to the beginnning publication record, so that author has h index = 0. Then, update the system.

  • Export the citation overview to a spreedsheet


Install the hIndexOverYears from Github

Step1. You need to install the devtools package.

install.packages("devtools")

Step2. Load the devtools package.

library(devtools)

Step3. Install the hIndexOverYears package.

install_github("jdieramon/hIndex")

Usage

Load the package

library(hIndexOverYears)

Let´s take a look at the documentation of the package.

library(help=hIndexOverYears)  

You can see the code for the functions:

?h.plot
?get1cite

Tidy Data

Before we start the analysis, we want to make the data tidy. The good thing is that Scopus keeps the same format for every citation overview, so data cleaning can be performed in one easy step. The function clean will read and clean the data for you.

Load and clean data

dat <- clean("CTOExport.csv")

Now, the dataset is ready for further analysis.

Plot the h Index

You can also use the h.plot function on the tidy data to show the h Index evolution over years. If the starting year does not correspond with h=0, you can enter the h value as an argument:

h.plot(dat, 2007, 2018, 0)


You may also want to use the `get1cite`function to list your most highly cited papers (top10) and get a sense of how long it takes to get then 1 citation. The function shows by default your top10 cited papers, but you can give the number of papers as an argument. It shows the average time (in months) per 1 cite.
get1cite(dat)


Session information

sessionInfo()