数据集 开放存取

BIP4COVID19:冠状病毒相关出版物的影响指标和指标

拟南芥; 伊利亚斯·卡内洛斯(Ilias Kanellos); 塞拉菲姆·查佐普洛斯; 达娜(Danae Pla Karidi); 西奥多·达拉加加斯


JSON格式导出

{
  "files": [
    {
      "links": {
        "self": "//americinnmankato.com/api/files/b38113de-1353-4a60-b954-276fb40ea35a/articles_by_influence_alt.txt"
      }, 
      "checksum": "md5:305e4da3e216f225d00bc8f9e7da45f5", 
      "bucket": "b38113de-1353-4a60-b954-276fb40ea35a", 
      "key": "articles_by_influence_alt.txt", 
      "type": "txt", 
      "size": 20635840
    }, 
    {
      "links": {
        "self": "//americinnmankato.com/api/files/b38113de-1353-4a60-b954-276fb40ea35a/articles_by_influence.txt"
      }, 
      "checksum": "md5:25ee01e94c45aff4df45c745f9747e50", 
      "bucket": "b38113de-1353-4a60-b954-276fb40ea35a", 
      "key": "articles_by_influence.txt", 
      "type": "txt", 
      "size": 20635840
    }, 
    {
      "links": {
        "self": "//americinnmankato.com/api/files/b38113de-1353-4a60-b954-276fb40ea35a/articles_by_popularity_alt.txt"
      }, 
      "checksum": "md5:37576c62b7b7e2d00a3e2651de8d8127", 
      "bucket": "b38113de-1353-4a60-b954-276fb40ea35a", 
      "key": "articles_by_popularity_alt.txt", 
      "type": "txt", 
      "size": 20635840
    }, 
    {
      "links": {
        "self": "//americinnmankato.com/api/files/b38113de-1353-4a60-b954-276fb40ea35a/articles_by_popularity.txt"
      }, 
      "checksum": "md5:94a4394e682c48281acdab05965ef465", 
      "bucket": "b38113de-1353-4a60-b954-276fb40ea35a", 
      "key": "articles_by_popularity.txt", 
      "type": "txt", 
      "size": 20635840
    }, 
    {
      "links": {
        "self": "//americinnmankato.com/api/files/b38113de-1353-4a60-b954-276fb40ea35a/articles_by_tweets.txt"
      }, 
      "checksum": "md5:1c1c127ac88acd1dfca2e4cdf3e5caff", 
      "bucket": "b38113de-1353-4a60-b954-276fb40ea35a", 
      "key": "articles_by_tweets.txt", 
      "type": "txt", 
      "size": 20635840
    }
  ], 
  "owners": [
    42037
  ], 
  "doi": "10.5281 / zenodo.4432856", 
  "stats": {
    "version_unique_downloads": 6519.0, 
    "unique_views": 94.0, 
    "views": 94.0, 
    "version_views": 58399.0, 
    "unique_downloads": 0.0, 
    "version_unique_views": 54780.0, 
    "volume": 0.0, 
    "version_downloads": 8535.0, 
    "downloads": 0.0, 
    "version_volume": 69130124684.0
  }, 
  "links": {
    "doi": "//doi.org/10.5281/zenodo.4432856", 
    "conceptdoi": "//doi.org/10.5281/zenodo.3723281", 
    "bucket": "//americinnmankato.com/api/files/b38113de-1353-4a60-b954-276fb40ea35a", 
    "conceptbadge": "//americinnmankato.com/badge/doi/10.5281/zenodo.3723281.svg", 
    "html": "//americinnmankato.com/record/4432856", 
    "latest_html": "//americinnmankato.com/record/4432856", 
    "badge": "//americinnmankato.com/badge/doi/10.5281/zenodo.4432856.svg", 
    "latest": "//americinnmankato.com/api/records/4432856"
  }, 
  "conceptdoi": "10.5281 / zenodo.3723281", 
  "created": "2021-01-11T20:24:50.729576+00:00", 
  "updated": "2021-01-12T00:52:11.487891+00:00", 
  "conceptrecid": "3723281", 
  "revision": 2, 
  "id": 4432856, 
  "metadata": {
    "access_right_category": "success", 
    "doi": "10.5281 / zenodo.4432856", 
    "description": "<p>This dataset contains impact metrics and indicators for a set of publications that are related to the <a href=\"//en.wikipedia.org/wiki/Coronavirus_disease_2019\">COVID-19 infectious disease</a> and the 新冠病毒 that causes it. It is based on:</p>\n\n<ol>\n\t<li>&Tau;he <a href=\"//pages.semanticscholar.org/coronavirus-research\">CORD-19 dataset</a> released by the team of <a href=\"//www.semanticscholar.org/\">Semantic Scholar</a><sup>1</sup> and</li>\n\t<li>&Tau;he curated data provided by the <a href=\"//www.ncbi.nlm.nih.gov/research/coronavirus/\">LitCovid hub</a><sup>2</sup>.</li>\n</ol>\n\n<p>These data have been cleaned and integrated with data from <a href=\"//github.com/echen102/COVID-19-TweetIDs\">COVID-19-TweetIDs</a> and from other sources (e.g., PMC). The result was dataset of&nbsp;230,857 unique articles along with relevant metadata (e.g., the underlying citation network). We utilized this dataset to produce, for each article, the values of the following impact measures:</p>\n\n<ul>\n\t<li><em><strong>Influence:</strong></em> Citation-based measure reflecting the total impact of an article. This is based on the PageRank<sup>3</sup> network analysis method. In the context of citation networks, it estimates the importance of each article based on its centrality in the whole network. This measure was calculated using the PaperRanking (<a href=\"//github.com/diwis/PaperRanking\">//github.com/diwis/PaperRanking</a>) library<sup>4</sup>.</li>\n\t<li><strong><em>Influence_alt:</em></strong> Citation-based measure reflecting the total impact of an article. This is the Citation Count of each article, calculated based on the citation network between the articles contained in the BIP4COVID19 dataset.</li>\n\t<li><em><strong>Popularity:</strong></em> Citation-based measure reflecting the current impact of an article. This is based on the AttRank<sup>5</sup> citation network analysis method. Methods like PageRank are biased against recently published articles (new articles need time to receive their first citations). AttRank alleviates this problem incorporating an attention-based mechanism, akin to a time-restricted version of preferential attachment, to explicitly capture a researcher&#39;s preference to read papers which received a lot of attention recently. This is why it is more suitable to capture the current &quot;hype&quot; of an article.</li>\n\t<li><em><strong>Popularity alternative:</strong></em> An alternative citation-based measure reflecting the current impact of an article (this was the basic popularity measured provided by BIP4COVID19 until version 26). This is based on the RAM<sup>6</sup> citation network analysis method. Methods like PageRank are biased against recently published articles (new articles need time to receive their first citations). RAM alleviates this problem using an approach known as &quot;time-awareness&quot;. This is why it is more suitable to capture the current &quot;hype&quot; of an article. This measure was calculated using the PaperRanking (<a href=\"//github.com/diwis/PaperRanking\">//github.com/diwis/PaperRanking</a>) library<sup>4</sup>.</li>\n\t<li><em><strong>Social Media Attention: </strong></em>The number of tweets related to this article. Relevant data were collected from the <a href=\"//github.com/echen102/COVID-19-TweetIDs\">COVID-19-TweetIDs</a> dataset. In this version, tweets between 7/11-13/11 have been considered from the previous dataset.&nbsp;</li>\n</ul>\n\n<p>We provide five CSV files, all containing the same information, however each having its entries ordered by a different impact measure. All CSV files are tab separated and have the same columns (PubMed_id, PMC_id, 土井, influence_score, popularity_alt_score, popularity score, influence_alt score, tweets count).</p>\n\n<p>The work is based on the following publications:</p>\n\n<blockquote>\n<ol>\n\t<li>COVID-19 Open Research 数据集 (CORD-19). 2020. Version 2021-01-03 Retrieved from //pages.semanticscholar.org/coronavirus-research. Accessed 2021-01-03. doi:10.5281/zenodo.3715506</li>\n\t<li>Chen Q, Allot A, &amp; Lu Z. (2020) Keep up with the latest 新冠病毒 research, Nature 579:193 (version 2021-01-03)</li>\n\t<li>R. Motwani L. Page, S. Brin and T. Winograd. 1999. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report. Stanford InfoLab.</li>\n\t<li>I. Kanellos, T. Vergoulis, D. Sacharidis, T. Dalamagas, Y. Vassiliou: Impact-Based Ranking of Scientific Publications: A Survey and Experimental Evaluation. TKDE 2019</li>\n\t<li>I. Kanellos, T. Vergoulis, D. Sacharidis, T. Dalamagas, Y. Vassiliou: Ranking Papers by their Short-Term Scientific Impact. CoRR abs/2006.00951 (2020)</li>\n\t<li>Rumi Ghosh, Tsung-Ting Kuo, Chun-Nan Hsu, Shou-De Lin, and Kristina Lerman. 2011. Time-Aware Ranking in Dynamic Citation Networks. In Data Mining Workshops (ICDMW). 373&ndash;380</li>\n</ol>\n</blockquote>\n\n<p>A Web user interface that uses these data to facilitate the 新冠肺炎 literature exploration, can be found <a href=\"//bip.covid19.athenarc.gr\">here</a>. More details in our preprint <a href=\"//www.biorxiv.org/content/10.1101/2020.04.11.037093v2\">here</a>.</p>\n\n<p>In this version, an extra score (influence_alt = Citation counts) has been included in the dataset.</p>\n\n<p><em><strong>Please cite:</strong> 拟南芥, 伊利亚斯·卡内洛斯(Ilias Kanellos), 塞拉菲姆·查佐普洛斯, 达娜(Danae Pla Karidi), 西奥多·达拉加加斯. &quot;BIP4COVID19: Releasing impact measures for articles relevant to 新冠肺炎&quot;. bioRxiv 2020.04.11.037093; doi: //doi.org/10.1101/2020.04.11.037093</em></p>\n\n<p><em><strong>Terms of use:</strong></em> These data are provided &quot;as is&quot;, without any warranties of any kind. The data are provided under the 知识共享署名4.0国际 license.</p>", 
    "license": {
      "id": "CC-BY-4.0"
    }, 
    "title": "BIP4COVID19:冠状病毒相关出版物的影响指标和指标", 
    "notes": "We acknowledge support of this work by the project \"Moving from Big Data Management to Data Science\" (MIS 5002437/3) which is implemented under the Action \"Reinforcement of the Research and Innovation 基础设施\", funded by the Operational Programme \"Competitiveness, Entrepreneurship and Innovation\" (NSRF 2014-2020) and co-financed by Greece and the European Union (European Regional Development Fund).", 
    "relations": {
      "version": [
        {
          "count": 37, 
          "index": 36, 
          "parent": {
            "pid_type": "recid", 
            "pid_value": "3723281"
          }, 
          "is_last": true, 
          "last_child": {
            "pid_type": "recid", 
            "pid_value": "4432856"
          }
        }
      ]
    }, 
    "communities": [
      {
        "id": "covid-19"
      }, 
      {
        "id": "zenodo"
      }
    ], 
    "version": "30", 
    "references": [
      "COVID-19 Open Research 数据集 (CORD-19). 2020. Version 2021-01-03. Retrieved from //pages.semanticscholar.org/coronavirus-research. Accessed 2021-01-03.", 
      "I. Kanellos, T. Vergoulis, D. Sacharidis, T. Dalamagas, Y. Vassiliou: Impact-Based Ranking of Scientific Publications: A Survey and Experimental Evaluation. TKDE 2019", 
      "I. Kanellos, T. Vergoulis, D. Sacharidis, T. Dalamagas, Y. Vassiliou: Ranking Papers by their Short-Term Scientific Impact. CoRR abs/2006.00951 (2020)", 
      "Rumi Ghosh, Tsung-Ting Kuo, Chun-Nan Hsu, Shou-De Lin, and Kristina Lerman. 2011. Time-Aware Ranking in Dynamic Citation Networks. In Data Mining Workshops (ICDMW). 373\u2013380", 
      "R. Motwani L. Page, S. Brin and T. Winograd. 1999. The PageRank Citation Ranking: Bringing Order to the Web. Technical Report. Stanford InfoLab.", 
      "Chen Q, Allot A, & Lu Z. (2020) Keep up with the latest 新冠病毒 research, Nature 579:193 (version 2021-01-03)"
    ], 
    "keywords": [
      "COVID-19", 
      "coronavirus", 
      "scientometrics", 
      "bibliometrics"
    ], 
    "publication_date": "2021-01-11", 
    "creators": [
      {
        "orcid": "0000-0003-0555-4128", 
        "affiliation": "雅典娜研究中心", 
        "name": "Thanasis Vergoulis"
      }, 
      {
        "orcid": "0000-0003-2146-3795", 
        "affiliation": "雅典娜研究中心", 
        "name": "Ilias Kanellos"
      }, 
      {
        "orcid": "0000-0003-1714-5225", 
        "affiliation": "雅典娜研究中心", 
        "name": "塞拉菲姆·查佐普洛斯"
      }, 
      {
        "orcid": "0000-0002-3154-6212", 
        "affiliation": "雅典娜研究中心", 
        "name": "Danae Pla Karidi"
      }, 
      {
        "orcid": "0000-0002-5002-7901", 
        "affiliation": "雅典娜研究中心", 
        "name": "Theodore Dalamagas"
      }
    ], 
    "access_right": "open", 
    "resource_type": {
      "type": "dataset", 
      "title": "Dataset"
    }, 
    "related_identifiers": [
      {
        "scheme": "url", 
        "identifier": "//pages.semanticscholar.org/coronavirus-research", 
        "relation": "cites", 
        "resource_type": "dataset"
      }, 
      {
        "scheme": "handle", 
        "identifier": "www.biorxiv.org/content/10.1101/2020.04.11.037093v2", 
        "relation": "isSupplementTo", 
        "resource_type": "publication-preprint"
      }, 
      {
        "scheme": "url", 
        "identifier": "//github.com/diwis/PaperRanking", 
        "relation": "cites", 
        "resource_type": "software"
      }, 
      {
        "scheme": "doi", 
        "identifier": "10.5281 / zenodo.3723281", 
        "relation": "isVersionOf"
      }
    ]
  }
}
58,399
8,535
意见
资料下载
所有版本 这个版本
观看次数 58,39994
资料下载 8,5350
数据量 69.1 GB0字节
独特的景色 54,78094
独特下载 6,5190

分享

引用为