This article provides a complete guide for researchers and professionals on using VOSviewer for bibliometric analysis in environmental science.
This article provides a complete guide for researchers and professionals on using VOSviewer for bibliometric analysis in environmental science. It covers foundational principles, from defining bibliometric networks to exploring environmental research landscapes. The guide details practical methodologies for constructing and visualizing networks based on citation, co-authorship, and keyword co-occurrence, using real-world examples from fields like resilient cities and microplastic pollution. It also addresses common troubleshooting and optimization techniques for data handling from sources like Scopus and Web of Science, and validates findings through comparative analysis with other tools. Ultimately, this resource empowers scientists to leverage VOSviewer for uncovering research trends, collaboration patterns, and emerging topics in environmental studies, enhancing the quality and impact of their literature reviews and research planning.
VOSviewer is a specialized software tool for constructing, visualizing, and exploring bibliometric networks. Developed by Nees Jan van Eck and Ludo Waltman at the Centre for Science and Technology Studies (CWTS) of Leiden University, its name stands for "Visualization of Similarities" [1] [2]. It is designed to create maps based on network data where the distance between items reflects their relatedness, providing a powerful means to analyze the structure of scientific literature [3].
While its core function is analyzing bibliometric data from sources like Web of Science, Scopus, and PubMed, VOSviewer is also a capable tool for text analysis, enabling the creation of co-occurrence maps from any body of text, such as academic abstracts or interview transcripts [1] [2].
In environmental science, VOSviewer helps researchers map the complex landscape of scientific knowledge. For instance, a bibliometric analysis of microplastic (MP) pollution research can use VOSviewer to visualize international collaborations and identify core research themes, such as distribution, toxic effects, and analytical methods [4]. The software can process thousands of publications to reveal trends and patterns that might otherwise remain hidden in large datasets.
Table 1: Quantitative Analysis of Microplastics Research via VOSviewer
| Aspect of Analysis | Key Finding | Data Source & Scope |
|---|---|---|
| Publication Volume | Explosive growth from 2004; 3,548 publications in 2022 alone (30.12% of total analyzed). | Web of Science (2004-2023); 11,777 English literature pieces [4]. |
| Global Research Participation | 147 countries participated, with China, the United States, the UK, Australia, and Canada being the most prolific. | Web of Science [4]. |
| Research Hotspots (Clusters) | 1. Distribution and sources of MPs2. Exposure and toxic effects3. Research methods for MPs4. Adsorption of MPs with other pollutants. | Keyword co-occurrence analysis performed with VOSviewer [4]. |
This section provides detailed methodologies for conducting a bibliometric analysis using VOSviewer, framed within the context of environmental science research.
A co-authorship network map reveals collaborative relationships between researchers, institutions, or countries [3] [1].
Workflow Overview
Step-by-Step Instructions
This analysis extracts key terms from publication titles and abstracts to identify predominant research topics and conceptual themes within a field [3] [2].
Workflow Overview
Step-by-Step Instructions
Table 2: Essential Tools for a VOSviewer-Based Bibliometric Study
| Item / Software | Function in the Workflow |
|---|---|
| Web of Science / Scopus | Primary data sources. These subscription-based databases are the gold standard for exporting high-quality bibliographic records for analysis [3] [4]. |
| VOSviewer Software | The core analysis and visualization engine. It is a free, Java-based application that constructs and visualizes the bibliometric networks [2]. |
| Microsoft Excel | Data cleaning and manipulation tool. Used for initial data organization and, if necessary, for reformatting non-bibliometric data (e.g., interview transcripts) into a structure compatible with VOSviewer's text analysis function [2]. |
| Lexos | A web-based text cleaning and tokenization tool. Useful for preparing large bodies of unstructured text (e.g., social media transcripts) by dividing them into standardized chunks ("tokens") for analysis in VOSviewer [2]. |
Within the domain of bibliometric analysis, understanding the intellectual structure and collaborative dynamics of a scientific field is paramount. VOSviewer, a specialized software tool, enables this exploration through the construction and visualization of several core types of bibliometric networks [6] [7]. These networks—citation, bibliographic coupling, co-authorship, and co-occurrence—provide unique lenses to investigate relationships between scholarly publications, authors, journals, and keywords [8]. In environmental science research, where the field is inherently interdisciplinary and rapidly evolving, these analyses are invaluable for mapping research trends, identifying key players, uncovering emerging topics, and tracing the flow of ideas [9] [10]. This document provides detailed application notes and experimental protocols for employing these core network types within the VOSviewer environment, with a specific focus on applications in environmental science.
VOSviewer supports the creation of multiple network types based on data from major bibliographic databases like Web of Science, Scopus, Dimensions, and Lens [8]. Each network type defines a unique relationship between the units of analysis (e.g., publications, authors, journals, or terms).
Table 1: Core Bibliometric Network Types in VOSviewer
| Network Type | Relationship Defined | Units of Analysis | Primary Use Case in Environmental Science |
|---|---|---|---|
| Citation | A publication cites another publication. | Publications, Journals | Tracing the influence and foundational literature of a field (e.g., seminal papers on climate change) [7]. |
| Bibliographic Coupling | Two publications reference a common third publication. | Publications, Journals | Mapping current research fronts and identifying groups of papers working on similar contemporary issues [7]. |
| Co-authorship | Two researchers, institutions, or countries co-author a publication. | Researchers, Institutions, Countries | Revealing collaborative partnerships and social networks within and across environmental research communities [10] [7]. |
| Co-occurrence | Two terms (e.g., keywords) appear together in the same publication. | Keywords, Terms from Titles/Abstracts | Identifying research hotspots, conceptual themes, and the intellectual structure of a field (e.g., linking "restorative environment" and "mental health") [10] [7]. |
Diagram 1: Logical relationships defining the four core network types.
Conducting a robust bibliometric analysis requires a set of "research reagents"—specialized tools and data sources. The table below details the essential components for a VOSviewer-based study.
Table 2: Key Research Reagents for VOSviewer Bibliometric Analysis
| Tool / Resource | Type | Primary Function | Relevance to Environmental Science |
|---|---|---|---|
| VOSviewer Software | Analysis & Visualization Tool | Constructs and visualizes bibliometric networks; performs layout, clustering, and mapping [6] [7]. | Core platform for mapping the intellectual structure of environmental research domains [9] [10]. |
| Bibliometrix (R-package) | Complementary Analysis Tool | A comprehensive R-tool for science mapping analysis; allows greater customization and analysis from multiple data sources [7]. | Useful for performing complementary statistical analyses and processing data from diverse databases concurrently [9]. |
| Web of Science / Scopus | Bibliographic Database | Primary sources for exporting high-quality metadata of scientific publications [10] [7]. | Provides comprehensive coverage of high-impact environmental science journals for data extraction [9] [10]. |
| Thesaurus File | Data Cleaning Tool | A text file used to merge synonymous terms (e.g., "climate change" and "global warming") [7]. | Crucial for normalizing diverse environmental terminology to ensure accurate keyword co-occurrence maps [9]. |
Application: To identify conceptual themes, research hotspots, and the intellectual structure in environmental science (e.g., "socio-environmental disclosure" or "restorative environments") [9] [10].
Diagram 2: Workflow for creating a keyword co-occurrence network.
Application: To reveal collaboration patterns among researchers, institutions, or countries within environmental science, such as international partnerships in climate change research [10].
Application: Citation analysis traces historical influence and foundational knowledge (what has been influential). Bibliographic coupling maps current research fronts and groups of papers actively working on similar problems (what is happening now) [7].
VOSviewer provides multiple visualization modes, each suited for a different type of analysis [8] [7]:
VOSviewer also incorporates natural language processing techniques to extract relevant terms from titles and abstracts for building co-occurrence networks and allows for the use of a thesaurus file to consolidate synonymous terms, which is critical for clean and accurate maps [8] [7].
The field of environmental science is experiencing a significant transformation, driven by the increasing urgency of global sustainability challenges and the data-driven capabilities of bibliometric analysis. This growth is particularly visible in research addressing the United Nations' Sustainable Development Goals (SDGs), where the volume of literature has expanded dramatically. Bibliometrics provides a powerful, quantitative framework for mapping the evolution of scientific knowledge, identifying emerging trends, and understanding the collaborative networks that underpin environmental research. Originating from information and library science, bibliometric methods have evolved, combining quantitative statistics with network analysis and data visualization techniques to create insightful maps of scientific literature [12].
The application of these methods within environmental science allows researchers and policymakers to digest large, complex corpora of scientific publications. For instance, a comprehensive bibliometric analysis of Sustainable Inclusive Economic Growth (SIEG) within the framework of SDG 8 recorded a substantial increase in research output, with a notable surge after 2019 as global efforts toward the UN 2030 Agenda intensified [11]. This analysis, conducted using specialized software, helped identify the most productive countries, influential authors, and leading journals, demonstrating the practical value of bibliometrics in tracking a rapidly evolving research domain [11]. This protocol details the methods for applying bibliometric analysis, specifically using the VOSviewer software, to track and visualize this explosive growth in environmental science.
Table 1: Essential Research Reagents for Bibliometric Analysis
| Item Name | Function/Brief Explanation |
|---|---|
| Bibliographic Database | A structured source of publication data (e.g., Scopus, Dimensions). Provides the raw metadata (authors, titles, citations, etc.) for analysis. |
| VOSviewer Software | A specialized tool for constructing and visualizing bibliometric networks. It performs the core analysis based on co-authorship, citation, co-citation, or keyword co-occurrence [12]. |
| Data Export File (.CSV) | A formatted file containing the exported records from the bibliographic database, structured for compatibility with VOSviewer [13]. |
| Analysis Threshold | A minimum frequency value (e.g., 5, 10, 20) applied to select the most relevant authors, keywords, or other units for analysis, ensuring a manageable and meaningful network [12]. |
| Color Schemes (e.g., Viridis) | Perceptually uniform color palettes used in VOSviewer for overlay and density visualizations. They improve interpretability over older schemes like the rainbow colormap [5]. |
This section outlines the step-by-step procedure for gathering and preparing publication data for a bibliometric analysis of environmental science topics, such as trends related to SDG 8.
( "sustainable inclusive economic growth" OR "SIEG" OR "SDG 8" ) AND PUBLICATION YEAR > 2014.This protocol details the process of creating and interpreting bibliometric maps using VOSviewer software, from data import to figure generation.
Launch and Import Data:
Select Analysis Type and Knowledge Unit:
Set the Analysis Threshold:
Generate and Refine the Map:
Customize Visuals and Export:
tab20 color scheme to clearly distinguish between different research fronts [5].The following diagram illustrates the logical workflow for a complete bibliometric analysis, from initial data collection to final interpretation.
Upon successful execution of the protocols, researchers can expect to generate several key quantitative and visual outputs that characterize the research landscape.
Table 2: Expected Core outputs from a Bibliometric Analysis
| Output Metric | Description | Example from SIEG/SDG 8 Analysis [11] |
|---|---|---|
| Productive Countries | Countries with the highest volume of publications on the topic. | China, India, and Italy emerged as the most productive. |
| Leading Journals | Journals publishing the most research in the domain. | Sustainability (Switzerland), published by MDPI, was the leading journal. |
| Influential Authors | Most cited researchers, indicating high-impact work. | Bekun FV, Onifade ST (Turkey), and Zhang X (China) were highly cited. |
| Collaboration Clusters | Groups of frequently collaborating countries/institutions. | Six co-authorship clusters were identified, with India leading one cluster (63 publications). |
| Dominant & Emerging Themes | Key topics and their evolution over time, identified via keyword analysis. | A shift from "financial inclusion & CSR" to "digital economy, blue economy, employment & entrepreneurship" was observed. |
The following diagram represents a hypothetical output of a keyword co-occurrence analysis generated in VOSviewer, simulating the structure and clusters one might find in a growing field like environmental science bibliometrics.
The analysis of a field like environmental science bibliometrics itself reveals dynamic patterns. The observed substantial increase in research output post-2019 in SIEG/SDG 8 research is a microcosm of the broader "explosive growth" in environmental science [11]. This surge is likely linked to intensified global efforts toward the 2030 Agenda for Sustainable Development.
Bibliometric mapping allows researchers to move beyond simple publication counts to understand the intellectual structure of a field. The identification of distinct thematic clusters (e.g., core concepts, social equity, environmental focus) and their interconnections, as visualized in the network diagram, helps pinpoint specialized research fronts and potential interdisciplinary bridges. Furthermore, the thematic evolution from traditional topics like financial inclusion toward cutting-edge areas like the digital and blue economy provides critical insights for scientists and funders seeking to anticipate future research directions [11].
The choice of visualization tools and settings directly impacts the clarity and accuracy of these insights. The adoption of perceptually uniform color schemes like viridis in VOSviewer, for example, prevents the misinterpretation of data that can occur with the previously used rainbow color scheme, ensuring that trends related to the average publication year or citation impact are correctly perceived [5].
Within the field of environmental science research, the ability to systematically map the intellectual landscape is crucial for identifying emerging trends, collaborative networks, and foundational knowledge. Bibliometric analysis has emerged as a powerful methodology for this purpose, originating from information and library science and combining quantitative methods with network analysis and data visualization [12]. The VOSviewer software, a tool whose name is short for "Visualization of Similarities," is specifically designed for creating, visualizing, and exploring bibliometric maps based on network data [14] [12]. Developed by van Eck and Waltman from Leiden University, it has become an indispensable instrument in the scientist's toolkit, enabling researchers to transform complex publication data into interpretable visual networks that reveal the underlying structure of scientific fields [12]. This application note details the protocols for employing VOSviewer to map research trends, identify key authors, and unveil thematic clusters, with specific examples framed within environmental science research.
VOSviewer supports several core types of analysis, each designed to answer different research questions. The general workflow begins with data extraction from bibliographic databases like Scopus or Web of Science, followed by data preprocessing to ensure compatibility, and culminates in network construction and visualization within VOSviewer [11] [15]. The table below summarizes the key applications, their objectives, and the required data.
Table 1: Core Bibliometric Analysis Applications in VOSviewer
| Application | Primary Research Objective | Type of Data Required | Visualization Output |
|---|---|---|---|
| Co-authorship Analysis | Map collaboration patterns between individuals, institutions, or countries. | Author names, institutional affiliations, countries. | Network where nodes represent authors/ institutions/countries; links represent joint publications. |
| Co-occurrence Analysis | Identify thematic clusters and conceptual structure of a field. | Keywords (author-generated or database-tagged) or terms from titles and abstracts. | Network where nodes represent keywords/terms; links represent frequency of co-occurrence. |
| Citation-Based Analysis | Identify influential works, authors, and journals, and map intellectual foundations. | Document citation data, author citation data, or journal citation data. | Network where nodes represent publications/authors/journals; links represent citation relationships. |
| Bibliographic Coupling | Map relationships between documents that cite the same references, revealing topical similarity. | Reference lists of citing documents. | Network where nodes are documents; link strength depends on number of shared references. |
Objective: To identify and visualize the main thematic areas and their evolution within a research domain, such as sustainable inclusive economic growth or machine learning in environmental chemical research [11] [15].
Methodology:
Data Collection and Preparation:
Network Creation and Threshold Setting:
Visualization and Interpretation:
viridis scheme) to more recent (warmer colors like yellow) [5].The following diagram illustrates the logical workflow for this co-occurrence analysis:
Objective: To identify the most influential researchers and patterns of scientific collaboration within a specific field using co-authorship and citation analysis.
Methodology:
Data Collection: Follow a similar data collection and preparation procedure as in Protocol 1, ensuring that author names and affiliations are included in the exported data.
Network Creation:
Visualization and Interpretation:
Table 2: Quantitative Data from a Bibliometric Analysis of SDG 8 Research (2015-2025)
| Metric | Findings | Implications |
|---|---|---|
| Most Productive Countries | China, India, Italy | Indicates geographic centers of research productivity and potential policy focus. |
| Leading Journal | Sustainability (Switzerland) | Identifies the primary academic outlet for this research field. |
| Highly Cited Researchers | Bekun FV, Onifade ST (Turkey); Zhang X (China) | Highlights influential authors and thought leaders. |
| Number of Country Clusters | 6 distinct collaboration clusters | Reveals the structure of international research collaboration. |
| Thematic Evolution | Shift from financial inclusion/CSR (2014-2023) to digital economy, blue economy (2024-2025) | Tracks the progression of research foci over time, showing emerging trends. |
In the context of bibliometric analysis with VOSviewer, "research reagents" refer to the essential software, data, and methodological components required to conduct a successful analysis.
Table 3: Essential Research Reagents for VOSviewer Analysis
| Reagent / Tool | Function / Description | Application Note |
|---|---|---|
| Bibliographic Database | Source of raw publication data (e.g., Scopus, Web of Science). | Provides structured data including citations, abstracts, and author affiliations for export and analysis. |
| VOSviewer Software | Core tool for constructing, visualizing, and exploring bibliometric maps. | Used for all network types (co-authorship, co-occurrence, citation). The latest versions use perceptually uniform color schemes like viridis by default [5]. |
| Data Preprocessing Scripts | Code (e.g., in Python, R) for cleaning and standardizing raw data. | Ensures data consistency (e.g., standardizing author name variants) before import, improving map accuracy. |
| VOSviewer Color Schemes | Define the palette for overlay and density visualizations. | The viridis scheme is perceptually uniform, aiding accurate interpretation. Coolwarm is a diverging scheme useful for highlighting above/below average values [5]. |
| Clustering Algorithm | The smart local moving algorithm used to identify thematic clusters. | Automatically groups related items (e.g., authors, keywords) within the network, defining the map's structure [14]. |
A full bibliometric study often integrates multiple analysis types. The following diagram outlines the comprehensive end-to-end workflow, from data acquisition to final interpretation, connecting the various protocols and reagents.
This application note provides a comprehensive technical overview of VOSviewer, a specialized software tool for constructing and visualizing bibliometric networks. Aimed at researchers, scientists, and drug development professionals, we document detailed protocols for leveraging VOSviewer's workflow within environmental science research contexts. The guidance covers data acquisition from major bibliographic databases, network construction techniques, visualization customization, and interpretation of analytical outputs. Emphasis is placed on practical methodology with structured data presentation and visual workflow documentation to facilitate immediate implementation by users conducting bibliometric analysis on scientific landscapes, particularly in tracking research trends on environmental degradation.
VOSviewer is a specialized software tool designed for constructing and visualizing bibliometric networks. These networks can represent various scholarly entities including journals, researchers, or individual publications, and are constructed based on citation analysis, bibliographic coupling, co-citation, or co-authorship relations. The software also provides text mining functionality to construct and visualize co-occurrence networks of important terms extracted from scientific literature [6]. Developed by the Centre for Science and Technology Studies (CWTS) in Leiden, VOSviewer has become an essential tool for researchers conducting systematic reviews and science mapping studies.
For environmental science researchers, VOSviewer offers powerful capabilities to map the rapidly evolving landscape of sustainability research. The tool enables the identification of key trends, influential authors, and emerging topics in fields such as environmental degradation, where publication growth exceeds 80% annually [16]. This application note provides a structured framework for utilizing VOSviewer within research contexts, with specific examples drawn from environmental bibliometrics.
VOSviewer undergoes regular updates to enhance functionality and data compatibility. The following table summarizes recent version improvements relevant to research applications:
Table 1: VOSviewer Version History and Feature Development
| Version | Release Date | Key Features and Improvements |
|---|---|---|
| 1.6.20 | October 31, 2023 | Improved map creation from API data; enhanced support for Scopus' new export format [6] |
| 1.6.19 | January 23, 2023 | Improved OpenAlex data support; fixed Web of Science data processing issues [6] |
| 1.6.18 | January 24, 2022 | Added OpenAlex data source; Europe PMC full-text search; Semantic Scholar organization maps; Crossref ROR ID querying [6] |
| 1.6.17 | July 22, 2021 | Online map sharing via VOSviewer Online; Lens export support; JSON file format [6] |
VOSviewer supports multiple data sources, which is crucial for comprehensive bibliometric analysis. The software can process data from:
The flexibility in data sourcing enables researchers to construct analyses from multiple literature corpora, enhancing the robustness of bibliometric findings.
This protocol outlines the methodology for analyzing research determinants of carbon emissions and environmental degradation, adapting the approach used in a recent bibliometric study [16].
Step 1: Database Selection and Search Strategy
Step 2: Data Export
Step 3: Data Import into VOSviewer
Step 4: Network Type Selection
Step 5: Counting Method and Thresholds
Step 6: Network Visualization and Refinement
The following flowchart represents the primary workflow for conducting bibliometric analysis in VOSviewer:
Bibliometric Analysis Workflow in VOSviewer
Table 2: VOSviewer Network Types and Research Applications in Environmental Science
| Network Type | Analysis Level | Research Application | Environmental Science Example |
|---|---|---|---|
| Co-authorship | Authors, Organizations, Countries | Identify collaboration patterns | Mapping international collaborations in climate change research [16] |
| Citation | Documents, Sources, Authors | Assess influence and impact | Identifying seminal papers on environmental Kuznets curve [16] |
| Bibliographic coupling | Documents, Sources, Authors | Group conceptually similar items | Clustering research on renewable energy and economic growth [16] |
| Co-citation | References, Sources, Authors | Map intellectual structure | Analyzing theoretical foundations of carbon emission studies [16] |
| Term co-occurrence | Keywords, Terms from text | Identify conceptual themes | Tracking evolution of research themes in environmental degradation [16] |
VOSviewer offers multiple color schemes optimized for different analytical purposes. The software transitioned from the rainbow color scheme to more perceptually uniform alternatives in version 1.6.7 [5]. The following diagram illustrates the color scheme selection process:
Color Scheme Selection Guide for VOSviewer Visualizations
Table 3: Essential Research Materials for Bibliometric Analysis with VOSviewer
| Research Reagent | Function in Analysis | Implementation Example |
|---|---|---|
| Scopus Database | Primary bibliographic data source for comprehensive coverage | Exporting 1365 documents on environmental degradation determinants [16] |
| VOSviewer Software | Network construction and visualization tool | Creating co-occurrence maps of keywords in environmental research [6] [16] |
| OpenAlex Data | Open bibliographic data alternative | Creating maps based on open data sources following Microsoft Academic discontinuation [6] |
| VOSviewer Online | Web-based visualization sharing platform | Embedding interactive bibliometric maps in online research platforms [6] |
| CitNetExplorer | Complementary citation network analysis tool | Analyzing citation networks of publications on carbon emissions [6] |
| Thematic Analysis Framework | Qualitative interpretation of visual patterns | Identifying key research themes like economic growth and renewable energy in environmental degradation [16] |
Implementing the protocol in section 3 for environmental degradation research yielded significant bibliometric insights [16]:
Key Findings from Environmental Degradation Analysis:
Visualization Parameters:
Color Accessibility in Scientific Visualizations: VOSviewer's transition from rainbow color schemes to perceptually uniform alternatives like viridis addresses several limitations [5]:
Data Source Migration: With the discontinuation of Microsoft Academic, VOSviewer has implemented support for alternative sources including OpenAlex, Europe PMC, and Semantic Scholar [6]. Researchers should verify current data source compatibility when designing bibliometric studies.
VOSviewer provides a robust methodological framework for conducting bibliometric analysis in environmental science research. The structured workflow encompassing data collection, network construction, visualization, and interpretation enables researchers to systematically map scientific landscapes and identify emerging trends. The application of these protocols to environmental degradation research demonstrates the practical utility of VOSviewer in tracking evolution of research themes, collaboration patterns, and knowledge structures. As bibliometric methodology continues to evolve, VOSviewer's ongoing development ensures compatibility with emerging data sources and visualization best practices, maintaining its position as an essential tool for research assessment and science mapping.
Bibliometric analysis has become an indispensable methodology in environmental science research, enabling researchers to map knowledge domains, identify emerging trends, and visualize collaborative networks. Within this context, VOSviewer has emerged as a powerful tool for constructing and visualizing bibliometric networks, supporting multiple analysis types including co-authorship, co-occurrence, citation, and bibliographic coupling. However, the quality and compatibility of input data fundamentally determine the success of these analyses. This protocol provides comprehensive guidance for exporting bibliographic data from three major sources—Scopus, Web of Science, and OpenAlex—with specific consideration for environmental science applications and VOSviewer compatibility.
The data acquisition process presents significant challenges, including platform-specific export limitations, format inconsistencies, and metadata completeness variations. Environmental science researchers particularly benefit from comprehensive data acquisition strategies due to the interdisciplinary nature of their field, which spans ecological systems, environmental chemistry, climate science, and sustainability studies. Proper data export and harmonization ensure that subsequent VOSviewer visualizations accurately represent the complex intellectual structure of environmental science research domains.
Table 1: Comparison of Export Capabilities from Major Bibliometric Data Sources
| Platform | Export Formats | Record Limits | VOSviewer Compatibility | Key Metadata Fields |
|---|---|---|---|---|
| Scopus | CSV, RIS | 2,000 per export [17] | High (CSV recommended for most analysis types) [17] | Citation information, affiliations, abstracts, keywords, references [17] |
| Web of Science | Plain Text (.txt), RIS | 500-1,000 per export [17] | High (Plain text recommended for most analysis types) [17] | Full record, cited references, author, title, source, abstract [17] |
| OpenAlex | CSV, RIS, Text (.txt) | 100,000 per export [18] | Moderate (may require format adjustment) | Flattened work data, reference managers compatibility [18] |
Table 2: Analysis Type Compatibility with VOSviewer by Export Format
| Analysis Type | Scopus CSV | WoS Plain Text | OpenAlex CSV | RIS Formats |
|---|---|---|---|---|
| Co-authorship | Supported [17] | Supported [17] | Limited | Limited [17] |
| Co-occurrence | Supported [17] | Supported [17] | Limited | Supported [17] |
| Citation | Supported [17] | Supported [17] | Limited | Limited |
| Co-citation | Supported [17] | Supported [17] | Limited | Limited |
| Bibliographic Coupling | Supported [17] | Supported [17] | Limited | Limited |
Scopus provides extensive coverage of environmental science literature, particularly strong in pollution research, ecological engineering, and environmental chemistry. The following protocol optimizes data extraction for VOSviewer analysis:
Search Strategy Formulation:
Export Configuration:
VOSviewer Preprocessing:
Scopus Data Export Workflow: Sequential steps for exporting bibliographic data from Scopus and preparing it for VOSviewer analysis.
Web of Science offers robust coverage of environmental science journals, particularly in fundamental ecology, environmental biology, and conservation science. This protocol maximizes data utility while working within platform constraints:
Search Optimization:
Export Execution:
Data Validation:
WoS Data Export Workflow: Step-by-step process for exporting data from Web of Science in VOSviewer-compatible format.
OpenAlex provides an open alternative to traditional bibliometric databases with significantly higher export limits, particularly valuable for comprehensive environmental science mapping studies:
Search Implementation:
Export Configuration:
Large-Scale Export Management:
OpenAlex Data Export Workflow: Procedure for exporting large datasets from OpenAlex while maintaining data quality and compatibility.
Bibliometric analyses in environmental science often require integrating data from multiple sources to overcome individual database limitations. BibexPy provides a Python-based solution for harmonizing datasets from Scopus and Web of Science, addressing common challenges in bibliometric data integration:
Environment Setup:
Data Integration Process:
VOSviewer Preparation:
Data Harmonization Workflow: Integration and enhancement process for multi-source bibliometric data using BibexPy.
Table 3: Essential Tools for Bibliometric Data Acquisition and Processing
| Tool/Platform | Function | Environmental Science Application |
|---|---|---|
| Scopus CSV Export | Exports comprehensive bibliographic data with references [17] | Enables co-citation analysis of environmental policy research and collaborative networks in climate science |
| Web of Science Plain Text Export | Provides full records with cited references in compatible format [17] | Supports historical analysis of ecological research trends and emerging topics in conservation biology |
| OpenAlex CSV Export | Offers open bibliometric data with high export limits [18] | Facilitates large-scale mapping of sustainability science and renewable energy research |
| BibexPy | Harmonizes datasets from multiple sources, performs deduplication, enriches metadata [19] | Enables comprehensive analysis of interdisciplinary environmental research spanning multiple domains |
| Reference Managers (Zotero/Endnote) | Imports and manages RIS format exports [18] | Organizes literature for systematic reviews in environmental health and risk assessment |
| Unpaywall API | Provides open access status and metadata enhancement [19] | Identifies publicly accessible environmental science literature for comprehensive analysis |
| Semantic Scholar API | Enhances metadata with citation context and research entities [19] | Enriches data for machine learning applications in environmental informatics |
Effective data acquisition from Scopus, Web of Science, and OpenAlex forms the critical foundation for robust bibliometric analysis in environmental science research using VOSviewer. Each platform offers distinct advantages and limitations in terms of export capabilities, record limits, and metadata completeness. Scopus provides the most versatile CSV exports for multiple analysis types but with intermediate record limits. Web of Science offers robust plain text exports compatible with VOSviewer but with more restrictive record constraints. OpenAlex presents an attractive open alternative with significantly higher export limits, though with potential format adjustment requirements for optimal VOSviewer compatibility.
For comprehensive environmental science mapping studies, researchers should consider a strategic approach combining data from multiple sources where possible, utilizing harmonization tools like BibexPy to address challenges of duplicate records, missing metadata, and inconsistent formats [19]. This integrated approach ensures that subsequent VOSviewer visualizations and analyses accurately capture the complex, interdisciplinary nature of environmental science research domains, from climate change studies to ecological conservation and environmental technology development.
VOSviewer is a powerful software tool for constructing and visualizing bibliometric networks, enabling researchers to reveal and explore the underlying structure of complex scientific fields such as environmental science. These networks can represent various relationships, including citations, co-authorships, bibliographic couplings, and co-occurrences of key terms extracted from scientific literature. For environmental scientists and drug development professionals, this capability provides a robust methodological framework for mapping the intellectual landscape of critical research areas—from climate change mitigation technologies to the environmental impact of pharmaceuticals. By transforming large, unstructured bibliographic datasets into interpretable visual maps, VOSviewer facilitates the identification of emerging trends, influential authors and institutions, and collaborative patterns within and across research domains [20] [21].
The construction of a scientifically valid and insightful network requires careful planning and execution across several stages: data collection and preparation, network parameter configuration, visualization, and interpretation. This guide provides a detailed, step-by-step protocol for building your first bibliometric network within the context of environmental science research. The methodology outlined here emphasizes practical application, ensuring that researchers can systematically create maps that accurately represent the structural and dynamic aspects of their field of investigation, thereby supporting both literature review processes and strategic research planning [1].
Successful network construction in VOSviewer requires specific "research reagents" – primarily datasets and software components. The table below details the essential materials and their functions in the network construction process.
Table 1: Essential Research Reagents for VOSviewer Network Construction
| Item Name | Type/Format | Primary Function in Network Construction |
|---|---|---|
| Bibliographic Database File | .txt, .ris (from Web of Science, Scopus, PubMed) |
Serves as the raw data input for constructing co-authorship, citation, or co-occurrence networks based on established scholarly records [1]. |
| Plain Text Corpus | .txt (structured with paragraphs as units) |
Acts as the input for constructing term co-occurrence networks through text mining, where each paragraph is treated as a context window [1]. |
| VOSviewer Desktop Application | Executable software (Windows, macOS, Linux) | The primary environment for data import, network parameter configuration, layout calculation, and visualization of the constructed network [21]. |
| Thesaurus File | .txt (simple two-column format) |
Used to merge synonymous terms or author name variants (e.g., "WHO" and "World Health Organization") to ensure conceptual consistency in the network [1]. |
| VOSviewer Online | Web-based platform | Enables the sharing and embedding of interactive network visualizations in web pages, facilitating collaboration and dissemination of findings [20]. |
| Pre-existing Network File | .gml, .pajek |
Allows for the import and visualization of networks constructed in other software tools (e.g., Gephi), leveraging VOSviewer's visualization capabilities [1]. |
This protocol is designed to map collaborative relationships between researchers, institutions, or countries within a specific environmental science subfield.
Step-by-Step Methodology:
"pharmaceutical pollution" AND "aquatic ecosystems"). Ensure the export format is compatible with VOSviewer (e.g., RIS or plain text) [1].This protocol generates a conceptual map of a research field by analyzing the co-occurrence of keywords or terms within a corpus of text, such as article titles and abstracts.
Step-by-Step Methodology:
environmental_abstracts.txt) where each paragraph represents a distinct textual unit, typically the abstract of a single research article [1].The quantitative parameters set during network construction profoundly impact the resulting map's scope and interpretability. The following table summarizes key thresholds and their typical values for environmental science applications.
Table 2: Key Quantitative Parameters for Network Construction in Environmental Science
| Parameter | Protocol 1: Co-authorship | Protocol 2: Term Co-occurrence | Impact on Final Network |
|---|---|---|---|
| Minimum Document Count | 5 | N/A | Determines the minimum productivity for an author/organization/country to be included, controlling node count [1]. |
| Minimum Term Frequency | N/A | 10 | Filters out infrequent terms, focusing the map on central, recurring concepts in the literature [1]. |
| Minimum Link Strength | 1 (or context-dependent) | 1 (or context-dependent) | Removes weak connections, simplifying the network to reveal the strongest collaborative or conceptual ties [22]. |
| Number of Relevant Terms | N/A | 500 | Limits the network to the most discriminative and meaningful terms, preventing visual clutter [1]. |
Beyond construction parameters, VOSviewer allows for extensive customization of the network visualization through its configuration object, which can be stored in a JSON file. This is critical for tailoring the map for publication or presentation, such as using color schemes optimized for accessibility or highlighting specific environmental science clusters [22] [5].
Table 3: Key Configuration Parameters in the VOSviewer JSON File for Visualization Control
Config Parameter (within parameters object) |
Data Type | Description and Common Values | Visualization Impact |
|---|---|---|---|
scale |
Float | Zoom level of the visualization (≥1). A higher value zooms in. | Controls the overall visible area of the network [22]. |
item_size |
Integer | Reference for node size (1=first option, 2=second, etc.). | Influences the relative prominence of nodes [22]. |
item_color |
Integer | Determines what property defines node color (1=cluster, 2=score, etc.). | Critical for highlighting clusters or temporal trends (overlays) [22]. |
score_colors |
String | Color scheme for score-based overlays (e.g., 'Viridis', 'Plasma', 'Coolwarm'). |
'Viridis' is the new perceptually uniform default, better than the old rainbow scheme [5]. |
min_score / max_score |
Float | Defines the range for the score color legend. | Focuses the color contrast on a specific data range, enhancing interpretability [22]. |
attraction / repulsion |
Integer | Force-directed layout parameters controlling node spacing. | Adjusts the layout density and clarity; higher repulsion spreads nodes apart [22]. |
resolution |
Float | Controls the cluster detection granularity (default=1.0). | A higher value typically results in a larger number of smaller, more specific clusters [22]. |
The entire process of building a network map in VOSviewer, from data collection to interpretation, follows a structured workflow. The diagram below illustrates the key decision points and procedural steps, highlighting the parallel paths for constructing co-authorship versus term co-occurrence networks.
The network visualization in VOSviewer is governed by a structured data model, particularly when using the JSON file format. This model defines how items (nodes), links (edges), and their visual properties are stored and interrelated. The following diagram depicts the core structure of this data model, which is essential for advanced users who wish to customize or programmatically generate network files.
Mastering the construction of bibliometric networks with VOSviewer provides environmental science researchers with a powerful analytical framework for navigating the expansive and complex body of scientific literature. By adhering to the detailed protocols for data preparation, network construction, and visualization configuration outlined in this guide, researchers can systematically generate evidence-based maps that reveal the structural dynamics of their field. The transition from raw data to an insightful visual map demystifies the research landscape, enabling the identification of knowledge gaps, emerging frontiers, and collaborative opportunities. As with any methodological tool, proficiency comes with practice. Researchers are encouraged to apply these protocols to their own domains, using the structured workflows and configuration options to build, refine, and interpret maps that advance their specific research objectives and contribute to the broader progress of environmental science.
VOSviewer is a powerful software tool for constructing and visualizing bibliometric networks, enabling researchers to discern complex patterns within large sets of academic literature. Within environmental science research, these visualizations help identify trending topics, collaborative networks, and emerging research frontiers. The software primarily generates three types of visualizations: network, overlay, and density views, each providing distinct analytical perspectives on bibliometric data. These maps can be constructed based on citation networks, bibliographic coupling, co-citation relationships, or co-authorship patterns, with additional text mining functionality to visualize co-occurrence networks of important terms extracted from scientific literature [21].
Table 1: Comparative Characteristics of VOSviewer Visualization Techniques
| Feature | Network View | Overlay View | Density View |
|---|---|---|---|
| Primary Function | Displays relationships and connections between items | Superimposes temporal or thematic information on network structure | Highlights area concentration and impact of research topics |
| Visual Elements | Nodes (circles) and links (lines) | Colored nodes over standard network layout | Color gradients from blue (low density) to red (high density) |
| Color Significance | Typically indicates cluster affiliation | Indicates specific properties (e.g., publication year, citation impact) | Indicates concentration and importance of topics |
| Interpretation Focus | Cluster identification, relationship strength | Temporal evolution, performance metrics | Research dominance, emerging hotspots |
| Environmental Science Application | Research theme identification, collaboration patterns | Tracking topic evolution, impact assessment | Identifying established vs. emerging research areas |
Table 2: Quantitative Parameters for VOSviewer Visualization Optimization
| Parameter | Network View | Overlay View | Density View |
|---|---|---|---|
| Node Size Scaling | Proportional to citation count or publication volume | Proportional to specific metric (e.g., recent citations) | Fixed size with density-based coloring |
| Cluster Resolution | 0.60-1.00 (moderate to high for clear separation) | 0.40-0.80 (lower to maintain base structure) | Not applicable |
| Minimum Cluster Size | 5-15 items for meaningful grouping | 5-15 items aligned with base network | Not applicable |
| Attraction Parameter | 1.5-3.0 for optimal layout | 1.5-3.0 (matches base network) | Not applicable |
| Repulsion Parameter | 0.0-1.0 for cluster separation | 0.0-1.0 (matches base network) | Not applicable |
| Label Size | Proportional to item importance or fixed for readability | Proportional to overlay metric | Minimal or no labels |
| Color Saturation | High for cluster distinction | Gradient based on time or performance metric | Continuous scale from blue to red |
Purpose: To create a network visualization mapping research themes in environmental science.
Materials and Reagents:
Methodology:
Purpose: To superimpose temporal evolution on bibliometric networks.
Materials and Reagents:
Methodology:
Purpose: To identify research hotspots and concentration areas in environmental science.
Materials and Reagents:
Methodology:
Table 3: Essential Research Reagents for VOSviewer Bibliometric Analysis
| Reagent/Material | Function | Specifications | Application Context |
|---|---|---|---|
| VOSviewer Software | Primary tool for constructing and visualizing bibliometric networks | Latest version; compatible with Windows, Mac, Linux; requires Java Runtime Environment | All visualization types: network, overlay, and density views [21] |
| Bibliographic Databases | Source of raw data for analysis | Scopus, Web of Science, or PubMed; RIS/CSV export format; inclusive of abstracts and citations | Data extraction for co-occurrence, citation, and co-authorship analysis |
| Normalization Algorithms | Standardize network measurements for fair comparison | Association strength, fractionalization, or clustering-based approaches | Network visualization to account for varying citation practices across fields |
| Clustering Techniques | Group related items into thematic clusters | Resolution parameter 0.60-1.00; minimum cluster size 5-15 items | Network view to identify research themes in environmental science |
| Color Gradients | Visual representation of temporal or impact metrics | Time-based: blue (older) to yellow (recent); Impact-based: blue (low) to red (high) | Overlay visualization to show evolution of research topics |
| Density Smoothing | Enhance visual interpretation of concentration areas | Kernel-based smoothing with adjustable bandwidth (10-20) | Density view to identify research hotspots without clutter |
| Layout Algorithms | Optimize spatial arrangement of network elements | VOS scaling, modularity-based, or force-directed approaches | All visualization types to minimize overlapping and improve readability |
The concept of urban resilience has evolved significantly from its ecological origins in the 1970s to become a critical framework for addressing modern urban challenges including climate change, natural disasters, and public health emergencies [23] [24]. This application note provides a structured protocol for conducting bibliometric analysis of resilient cities research using VOSviewer software, enabling researchers to quantitatively map the intellectual landscape and identify emerging trends in this rapidly growing field. The methodology outlined here facilitates objective assessment of research publications, authors, institutions, and conceptual themes within resilient cities literature, supporting evidence-based decision-making in urban policy and planning [23] [25].
Database Selection and Search Strategy
Data Screening and Cleaning
Network Construction Parameters in VOSviewer
Visualization Optimization
Table 1: Publication Trends in Resilient Cities/Communities Research (1995-2022)
| Time Period | Stage Classification | Annual Publications Range | Cumulative Publications | Key Influencing Factors |
|---|---|---|---|---|
| 1995-2004 | No Attention Period | 0-2 publications annually | <10 total | Limited conceptual development; minimal practical application |
| 2005-2014 | Starting Period | Steady growth with minor fluctuations | ~100 total | UNISDR "Making Cities More Resilient" campaign (2010) |
| 2015-2022 | Rapid Growth Period | 213 publications by 2021 | 1148 total | Rockefeller Foundation 100 Resilient Cities (2013); climate change urgency; COVID-19 pandemic |
Analysis of publication trends reveals three distinct developmental phases in resilient cities research [23]. The field remained largely dormant until 2004, experienced gradual growth following international resilience campaigns, and entered a period of rapid expansion after 2014, largely driven by major global initiatives and escalating climate concerns [23] [24]. The acceleration phase corresponds with the launch of the Rockefeller Foundation's "100 Resilient Cities" program in 2013, which significantly stimulated research interest and output [24].
Table 2: Key Research Sources and Contributors in Resilient Cities Research
| Category | Top Elements | Quantitative Metrics | Significance/Focus |
|---|---|---|---|
| Core Journals | Sustainability | 73.2% of publications as articles [23] | Primary outlet for resilient cities research |
| International Journal of Disaster Risk Reduction | High citation frequency [23] | Focus on practical disaster mitigation strategies | |
| Leading Authors | Serre | Most productive author [23] | Expertise in infrastructure resilience and risk management |
| Shaw | High publication output [23] | Contributions to community and social resilience | |
| Institutional Leaders | Colorado State University | Leading research institution [23] | Interdisciplinary resilience research center |
| Delft University of Technology | Prominent European institution [23] | Water management and climate adaptation expertise | |
| Texas A&M University | Major contributor [23] | Community and regional resilience focus | |
| Geographical Distribution | United States | Leading country [23] | Extensive research funding and institutional support |
| Global North Countries | Majority of publications [23] [26] | Disproportionate research output compared to implementation needs |
Journal analysis indicates that resilient cities research is published across interdisciplinary platforms, with Sustainability and International Journal of Disaster Risk Reduction serving as primary venues [23]. The United States maintains dominance in research output, though the Global South represents critical areas for implementation and case studies [26].
Table 3: Essential Research Tools and Data Sources for Resilient Cities Bibliometric Analysis
| Tool/Resource | Type/Function | Specific Application in Resilient Cities Research | Access Method |
|---|---|---|---|
| VOSviewer Software | Bibliometric visualization tool | Constructing and visualizing networks of journals, researchers, publications based on citation and co-authorship relations [6] | Free download from VOSviewer.com |
| Web of Science Core Collection | Scientific literature database | Primary data source for comprehensive publication records on resilient cities; enables precise bibliometric queries [23] [24] | Institutional subscription required |
| CiteSpace | Complementary visualization software | Analyzing citation networks and identifying emerging trends through burst detection [24] | Free Java application |
| 100 Resilient Cities Strategies | Policy document collection | Content analysis of urban resilience plans; evaluating equity and justice considerations in planning [26] | Publicly available from Rockefeller Foundation |
| City Resilience Program (CRP) Data | Geospatial urban data | Spatial analysis of resilience challenges; mapping population exposure to hazards [27] | World Bank/GFDRR partnership |
| ArcGIS StoryMaps | Geospatial storytelling platform | Creating interactive visualizations of urban resilience indicators and spatial relationships [27] | Esri platform subscription |
Bibliometric analysis of resilient cities research reveals several dominant thematic clusters that have emerged over the past decade [23] [24]. The conceptual foundation has evolved through three distinct phases: engineering resilience (focusing on return to stable state), ecological resilience (emphasizing adaptation and multiple stable states), and evolutionary resilience (highlighting transformation and adaptive capacity) [24]. Current research hotspots identified through keyword analysis include:
Data Comprehensiveness
Interpretation Challenges
This protocol provides a comprehensive framework for conducting bibliometric analysis of resilient cities research using VOSviewer. The systematic approach to data collection, processing, and visualization enables researchers to identify knowledge gaps, track conceptual evolution, and map collaborative networks within this interdisciplinary field. Application of these methods supports strategic research planning, evidence-based policy development, and identification of potential collaborators across institutions and geographical regions [23] [24]. As urban resilience challenges continue to evolve in complexity and scale, bibliometric analysis offers a valuable tool for navigating the expanding knowledge domain and directing future research toward areas of greatest need and potential impact.
Microplastics (MPs), plastic particles less than 5 mm in size, have emerged as a global environmental contaminant of significant concern, threatening aquatic and terrestrial ecosystems worldwide [4]. The rapid expansion of microplastic research over the past decade has created an extensive body of literature that requires systematic analysis to identify research trends, knowledge gaps, and emerging frontiers. This case study employs VOSviewer software for bibliometric analysis to map the intellectual landscape and research hotspots in microplastic pollution studies. By quantitatively analyzing publication patterns, collaborations, and keyword relationships, we provide researchers with a comprehensive overview of the field's structure and evolution, supporting strategic research planning and resource allocation in environmental science.
Bibliometric analysis requires systematic data collection to ensure comprehensive coverage of the research domain. The following protocol outlines the standardized approach for data retrieval:
TS=(microplastic OR microplastics OR "plastic debris" OR micro-plastic OR nanoplastics) AND TS=(marine OR sea OR ocean OR beach OR bay OR gulf OR estuary OR coastline OR shoreline) AND TS=(contamination OR pollution OR contaminate OR pollute OR stain OR filth OR contaminant OR foul) AND TS=(removal OR removal OR removed OR remove OR exenterate OR dispose OR expulsion OR erasing OR eliminate OR degradation OR degrade OR decomposition OR decompose OR degeneration OR hydrolysis OR degradable OR dissipation OR harness OR governance OR treatment OR control OR management OR government OR govern OR administration OR regulation) [30]The analytical workflow employs specialized bibliometric software to process and visualize literature data:
Figure 1: Bibliometric Analysis Workflow for Microplastic Research
Microplastic research has experienced exponential growth over the past decade, reflecting increasing global concern about plastic pollution:
Table 1: Annual Publication Trends in Microplastic Research (2013-2022)
| Year | Cumulative Publications | Annual Publications | Key Driving Events |
|---|---|---|---|
| 2013 | Baseline | Minimal | Growing scientific interest |
| 2014 | - | Rapid increase | - |
| 2015 | - | Continued growth | UN Sustainable Development Summit |
| 2019 | - | >1,000 annual publications | Increased global awareness |
| 2022 | 11,777 total [4] | 3,548 [4] | 30.12% of total analyzed publications |
Geographic analysis reveals concentrated research efforts in specific countries, with 147 countries having participated in microplastic pollution research [4]. The most productive countries include:
These countries not only demonstrate high research productivity but also maintain extensive international collaboration networks, facilitating global knowledge exchange on microplastic pollution.
Keyword co-occurrence analysis in VOSviewer reveals four primary research clusters in microplastic studies:
Table 2: Major Research Clusters in Microplastic Pollution Studies
| Research Cluster | Key Focus Areas | Specific Topics |
|---|---|---|
| Distribution & Sources [4] | Spatial patterns, input pathways | Marine vs. freshwater distribution, land-based sources, atmospheric transport, wastewater treatment plant effluent |
| Toxicological Effects [4] | Organism impacts, ecosystem risks | Ingestion by marine organisms, biological toxicity, food web transfer, physiological and behavioral effects |
| Analytical Methods [4] | Detection, quantification | Sampling techniques, identification methods, standardization needs, size fractionation |
| Adsorption & Interactions [4] | Pollutant interactions, vector effects | Adsorption with other pollutants, chemical additive leaching, persistent organic pollutants, metals |
The conceptual structure of microplastic research has evolved significantly over time, transitioning from initial focus on traceability and hazard analysis to broader examination of economic activities and synthetic fibers as major contributors to microplastic pollution [30]. Current research frontiers include microplastics in wastewater treatment plant effluent, human consumption impacts, synthetic textiles, and polymer degradation processes [30].
Comprehensive microplastic monitoring requires standardized field sampling protocols to ensure data comparability across studies:
Microplastic identification and characterization requires meticulous laboratory procedures to ensure accurate results:
Figure 2: Microplastic Laboratory Analysis Workflow
Contamination control is paramount in microplastic research due to ubiquitous presence of synthetic particles in laboratory environments:
Table 3: Essential Research Reagents and Materials for Microplastic Studies
| Item | Function/Application | Technical Specifications |
|---|---|---|
| Cellulose Nitrate Filters | Sample filtration | 0.45μm pore size [31] |
| Plankton Nets | Field sampling | 100μm mesh size [31] |
| Decon-90 | Equipment cleaning | Removes organic contaminants [31] |
| Glass Containers | Sample collection/storage | Pre-cleaned, foil-covered [31] |
| Micro-Raman Spectroscopy | Polymer identification | Molecular characterization [31] |
| Stereomicroscope | Visual identification | 10-40x magnification range [31] |
| Cotton Laboratory Apparel | Contamination control | 100% cotton material [31] |
Bibliometric analysis of microplastic research provides valuable insights for scientific advancement and policy development:
The hotspot analysis reveals critical areas requiring further investigation, including:
The research trends identified through bibliometric analysis support evidence-based decision making:
This case study demonstrates the powerful application of VOSviewer bibliometric analysis for mapping research hotspots in microplastic pollution science. The exponential growth in publications, shifting research frontiers, and emerging geographic collaborations highlighted through this analysis provide a strategic roadmap for researchers, funding agencies, and policy makers. The standardized protocols and methodological frameworks presented offer practical guidance for conducting environmentally relevant microplastic research. As the field continues to evolve, ongoing bibliometric monitoring will be essential for identifying new research directions, facilitating international collaboration, and effectively addressing the global challenge of microplastic pollution through evidence-based scientific approaches.
Co-occurrence networks are powerful text mining tools that visually map relationships between entities within large collections of documents. In scientific literature analysis, these networks reveal knowledge structures and research trends by representing co-occurring keywords, authors, or concepts as interconnected nodes within a graphic visualization [32]. Within environmental science research, where interdisciplinary work is prevalent, this method helps researchers identify emerging topics, collaborative networks, and thematic clusters across vast bibliographic datasets.
The fundamental principle involves identifying paired entities that appear together within specified text units (e.g., article keywords, titles, or abstracts). Each entity becomes a network node, and each co-occurrence forms a connection link [33]. The number of co-occurrences determines the connection strength, visually representing the cumulative knowledge of a scientific domain [33]. When integrated with specialized software like VOSviewer, this approach enables environmental scientists to extract meaningful patterns from complex literature corpora, thereby informing research directions and gap analyses.
Co-occurrence networks belong to the broader category of semantic networks that graphically visualize potential relationships between entities represented within written material [32]. The network construction process follows three systematic steps: (1) identifying relevant keywords or terms from the text corpus, (2) calculating frequencies of individual terms and their paired co-occurrences, and (3) analyzing the resulting networks to identify central terms and thematic clusters [32]. The construction criteria can be adjusted for specificity—for instance, requiring co-occurrence within the same sentence rather than merely within the same document for more precise relationship mapping.
In environmental science contexts, the network structure reveals specialized subdomains through distinct clustering patterns. For example, terms like "carbon sequestration," "soil organic matter," and "biochar" might form one cluster, while "microplastic pollution," "marine ecosystems," and "trophic transfer" form another. The interconnection density between these clusters indicates their conceptual relatedness within the research landscape, providing insights into knowledge integration across environmental subdisciplines.
Beyond visual inspection, co-occurrence networks employ quantitative metrics to extract meaningful insights. The table below summarizes key metrics used in network analysis:
Table 1: Key Metrics for Co-occurrence Network Analysis
| Metric | Calculation | Interpretation in Research Context |
|---|---|---|
| Node Degree | Number of connections to other nodes | Indicates popularity or centrality of a concept within the research field |
| Betweenness Centrality | Number of shortest paths passing through a node | Identifies bridging concepts that connect different research themes |
| Link Weight | Frequency of co-occurrence between two terms | Reflects strength of association between concepts |
| Modularity | Ability of network to decompose into modules | Measures delineation of distinct research themes or subfields |
| Strength | Sum of weights of all links connected to a node | Combines node connectivity and association strength |
These metrics enable environmental scientists to move beyond simple term frequency counts to understand the structural role of specific concepts within the research landscape. For instance, a term with high betweenness centrality might represent an integrative concept connecting traditionally separate subfields like "environmental justice" bridging toxicology and policy studies [33].
VOSviewer (Visualization Of Similarities viewer) is a specialized software tool developed by researchers at Leiden University's CWTS for constructing and visualizing bibliometric networks [21]. Its design specifically addresses the needs of scientific mapping, offering functionality for creating co-authorship networks, term co-occurrence maps, and citation-based networks from major bibliographic databases [21]. For environmental scientists, VOSviewer provides accessible entry into complex network analysis without requiring advanced programming skills.
The software accepts multiple data formats, including direct imports from Web of Science, Scopus, and PubMed/Medline, streamlining the initial data processing stages [1]. Additionally, VOSviewer offers text mining functionality that can construct co-occurrence networks from important terms extracted from scientific literature bodies, making it particularly valuable for emerging research domains where keyword standardization may still be evolving [21].
While VOSviewer offers user-friendly network visualization, other software tools provide complementary functionalities. CiteSpace enables temporal analysis of research trends, particularly valuable for tracking the evolution of environmental science concepts [34]. Network Workbench provides additional statistical analysis capabilities for larger datasets [33]. The integration of these tools creates a comprehensive analytical suite for sophisticated bibliometric research.
Table 2: Software Tools for Co-occurrence Network Analysis
| Software | Primary Function | Environmental Science Application |
|---|---|---|
| VOSviewer | Network visualization and clustering | Identifying research themes and collaborative patterns |
| CiteSpace | Temporal trend analysis and burst detection | Tracking emerging concepts in environmental policy |
| Network Workbench | Large-scale network statistics | Analyzing cross-disciplinary connections |
| Gephi | Advanced network manipulation and visualization | Creating publication-ready network maps |
Environmental science research faces particular challenges in synthesizing knowledge across multiple disciplines and addressing rapidly evolving issues like climate change impacts and ecosystem management. Co-occurrence network analysis addresses these challenges by mapping the conceptual structure of environmental research domains. For this case study, we simulate a bibliometric analysis investigating interconnections between "nanomaterial environmental safety" and "regulatory science" research—a domain with significant policy implications where VOSviewer has demonstrated utility in previous studies [33].
The analysis aims to: (1) identify major research themes within nanomaterial environmental risk literature, (2) detect emerging topics and temporal trends, and (3) visualize collaborative networks among researchers and institutions. This approach facilitates evidence-based literature reviews by providing data-driven insights before undertaking more rigorous systematic reviews [33].
Data Source Selection and Search Strategy
TS=("nano*" AND "risk assessment") OR ("nano*" AND "environment* safety") OR ("nano*" AND "ecotoxic*")Data Cleaning and Standardization
The following diagram illustrates the complete workflow for creating co-occurrence networks from scientific literature:
Step 1: Data Import and Network Type Selection
Step 2: Counting Method and Normalization
Step 3: Network Visualization and Customization
Step 4: Analysis and Interpretation
Table 3: Essential Research Reagents for Co-occurrence Network Analysis
| Tool/Resource | Function in Analysis | Application Example |
|---|---|---|
| Web of Science Core Collection | Primary bibliographic data source | Comprehensive coverage of environmental science literature |
| VOSviewer Software | Network construction and visualization | Creating co-occurrence maps of sustainability research |
| Thesaurus File | Standardization of variant terms | Merging "climate change" and "global warming" references |
| CiteSpace | Temporal pattern analysis | Detecting emerging concepts in renewable energy research |
| Microsoft Excel | Data cleaning and preprocessing | Managing author institutional affiliations |
Beyond static network visualization, VOSviewer enables examination of research evolution through overlay visualizations that map scientific activity across time periods [34]. For environmental scientists, this functionality helps track conceptual shifts in response to policy developments or technological breakthroughs. The software color-codes nodes based on publication year, creating a chronological landscape of research focus.
Implementation involves:
Network clusters require rigorous interpretation to ensure they represent meaningful research themes rather than algorithmic artifacts. The following validation protocol enhances analytical robustness:
Quantitative Validation
Qualitative Validation
While powerful, co-occurrence network analysis presents several methodological challenges that environmental scientists must address:
Terminology Issues
Database Biases
Recent critical reviews caution against treating co-occurrence networks as direct representations of ecological relationships without proper validation [35]. Environmental scientists should supplement network findings with traditional review methods and expert consultation to avoid misinterpretation of algorithmic patterns.
Environmental science research presents unique challenges including terminological diversity, cross-disciplinary integration, and rapidly evolving concepts. The following optimization strategies address these challenges:
Domain-Specific Adaptations
Validation Frameworks
Co-occurrence network analysis represents a powerful methodology for mapping the complex, interdisciplinary landscape of environmental science research. When implemented through VOSviewer with appropriate methodological rigor, it enables researchers to identify emerging topics, track conceptual evolution, and visualize knowledge structures across vast literature corpora. The protocols outlined provide a comprehensive framework for environmental scientists to harness these techniques, from data collection through advanced temporal analysis.
As environmental challenges grow increasingly complex, such data-driven literature analysis methods become essential tools for research planning, gap identification, and interdisciplinary collaboration facilitation. By integrating these network approaches with domain expertise, environmental scientists can more effectively navigate the expanding research landscape and identify productive pathways for addressing pressing ecological issues.
In bibliometric analysis, particularly within environmental science research, the integrity of findings is fundamentally dependent on the quality of the underlying keyword data. Synonyms and keyword variations introduce significant noise, potentially skewing network maps, misrepresenting conceptual relationships, and compromising the validity of conclusions derived from tools like VOSviewer [36]. Environmental science is especially prone to this issue, encompassing a lexicon that includes "ecosystem services," "ecological goods," "ecological products," and "environmental degradation," often used interchangeably across different studies and schools of thought [36] [16]. This application note establishes a standardized protocol for the cleaning and harmonization of keyword data, ensuring that subsequent bibliometric visualizations and analyses accurately reflect the true intellectual structure of the research landscape.
The challenge of keyword variability is not merely anecdotal but is quantifiable in the literature. The following table summarizes key aspects of keyword dynamics identified in bibliometric studies:
Table 1: Quantitative Insights into Keyword Dynamics in Scientific Literature
| Aspect | Quantitative Finding | Source Context |
|---|---|---|
| High-Frequency Environmental Keywords | "ecosystem services", "valuation", "biodiversity", "management", "conservation" are high-frequency, high-centrality terms [36]. | Analysis of international research on ecological product value. |
| Primary Research Drivers | Economic growth, renewable energy, and the Environmental Kuznets Curve are dominant themes [16]. | Bibliometric analysis of environmental degradation research (1,365 papers). |
| Keyword Repetitiveness as a Specialization Metric | Proposed Sj index measures journal specialization as the average frequency of a keyword's appearance in a journal [37]. | Study of keyword occurrences across 88,583 articles in 50 journals. |
These findings underscore that effective data cleaning must go beyond simple merging of obvious duplicates. It requires an understanding of the thematic context—recognizing that "carbon emission" and "CO2" are functionally identical in many environmental studies [16]—and an awareness of the level of conceptual granularity, where broad terms like "management" coexist with specific ones like "contingent valuation" [36].
This protocol provides a step-by-step methodology for preprocessing a raw keyword dataset exported from databases like Web of Science or Scopus before import into VOSviewer.
Table 2: Essential Research Reagent Solutions for Bibliometric Data Cleaning
| Item Name | Function/Description |
|---|---|
| Raw Bibliometric Data | The initial dataset, typically in .txt or .csv format, containing author keywords, KeyWords Plus, titles, and abstracts. |
| Data Preprocessing Software | A tool for bulk text manipulation (e.g., Python with Pandas, R, OpenRefine, or even advanced Excel functions). |
| Taxonomy/Thesaurus File | A custom-built list defining groups of synonymous terms and their preferred standardized label (e.g., "CO2" -> "carbon emissions"). |
| VOSviewer Software | The bibliometric analysis and visualization tool for which the data is being prepared [36] [16]. |
Step 1: Data Acquisition and Initial Preprocessing
Step 2: Building a Custom Environmental Science Harmonization Taxonomy
Step 3: Automated Term Harmonization
Step 4: Validation and Iteration
The following workflow diagram illustrates the logical sequence and decision points of this protocol:
Applying this protocol within environmental science requires domain-specific knowledge. The table below provides illustrative examples of synonym groups pertinent to this field, derived from bibliometric research.
Table 3: Exemplary Keyword Harmonization for Environmental Science Bibliometrics
| Standardized Preferred Label | Common Synonyms and Variations to Map |
|---|---|
| Ecosystem Services | Ecological services, environmental services, ecosystem goods, ecological products [36]. |
| Carbon Emissions | CO2, carbon dioxide, CO2 emissions, carbon emission [16]. |
| Environmental Kuznets Curve | EKC, Kuznets curve [16]. |
| Renewable Energy | Green energy, alternative energy, sustainable energy [16]. |
| Valuation | Economic valuation, contingent valuation, ecosystem service valuation [36]. |
| Biodiversity | Biological diversity, species richness [36]. |
Failure to implement this cleaning process can lead to misleading visualizations in VOSviewer. For instance, "CO2" and "carbon emissions" would appear as distinct, unconnected nodes in a co-occurrence network, artificially fragmenting the research domain and obscuring the true centrality of this topic [16]. Harmonizing these terms ensures that the resulting map accurately conveys the collective scholarly focus on this critical aspect of environmental degradation.
Rigorous data cleaning is the indispensable foundation of any reliable bibliometric analysis. The systematic protocol outlined here for handling synonyms and keyword variations empowers researchers to transform noisy, inconsistent raw data into a structured and valid dataset. By adopting these best practices, environmental scientists can leverage VOSviewer to generate more accurate, interpretable, and trustworthy maps of their research landscape, thereby providing a solid evidence base for scientific insight and policy decision-making.
In the field of environmental science research, bibliometric analysis using VOSviewer has become an indispensable methodology for mapping the intellectual landscape, identifying emerging trends, and understanding collaborative networks. The software enables researchers to construct and visualize various bibliometric networks, including co-authorship, co-citation, and keyword co-occurrence networks [20]. However, the creation of meaningful and interpretable maps requires careful consideration of threshold settings, which directly determine the balance between analytical detail and visual clarity. Proper threshold selection ensures that visualizations highlight the most significant patterns without becoming cluttered with irrelevant information, making it a critical skill for researchers, scientists, and drug development professionals working with complex environmental datasets.
This application note provides a comprehensive framework for selecting appropriate thresholds in VOSviewer, with specific considerations for environmental science research. We present detailed protocols, quantitative guidelines, and visualization strategies to help researchers optimize their bibliometric maps for maximum analytical value and communicative power.
Thresholds in VOSviewer function as filtration mechanisms that determine which elements (items, links, clusters) appear in the final visualization. These parameters are essential for managing visual complexity while maintaining analytical integrity. The software employs several types of thresholds that operate on different attributes of the bibliometric data, each serving a distinct purpose in the map refinement process.
The item weight threshold controls which items (authors, keywords, journals, countries) appear in the visualization based on quantitative metrics such as publication count, citation count, or total link strength [38]. Items falling below this threshold are excluded from the map, allowing researchers to focus on the most significant elements. The link strength threshold determines which connections between items are displayed, filtering out weaker associations that might contribute to visual noise without adding substantive analytical value [22]. Additionally, VOSviewer incorporates cluster resolution parameters that influence how items are grouped together, with higher resolution values typically resulting in more numerous, finer-grained clusters [22].
Based on analysis of published bibliometric studies in environmental science and related fields, the following table provides recommended threshold ranges for different types of analyses. These values serve as starting points that should be refined based on specific research questions and dataset characteristics.
Table 1: Recommended Threshold Ranges for Environmental Science Bibliometrics
| Analysis Type | Dataset Size | Minimum Item Weight | Minimum Link Strength | Resolution Parameter |
|---|---|---|---|---|
| Co-authorship | 500-2,000 items | 2-5 documents | 1-2 co-authored papers | 1.0-1.5 |
| Keyword Co-occurrence | 3,000-10,000 items | 5-10 occurrences | 3-5 co-occurrences | 1.2-1.8 |
| Citation | 1,000-5,000 items | 5-15 citations | 2-4 citation links | 0.8-1.2 |
| Country Collaboration | 50-100 countries | 1-2 collaborative papers | 1-2 collaborative links | 1.0-1.5 |
Purpose: To establish an optimized threshold setting for keyword co-occurrence analysis in environmental science research, balancing comprehensive coverage with visual interpretability.
Materials and Reagents:
Method Details:
Data Extraction and Preparation
Initial Threshold Setting
Iterative Refinement
Validation and Documentation
Purpose: To identify optimal thresholds for mapping collaboration networks in environmental science research communities.
Materials and Reagents:
Method Details:
Data Collection and Cleaning
Threshold Configuration
Network Refinement
Interpretation and Analysis
The following diagram illustrates the threshold optimization workflow for bibliometric analysis in VOSviewer:
Workflow for Threshold Optimization in VOSviewer
Successful bibliometric analysis in environmental science requires both specialized software and methodological knowledge. The following table details essential components of the bibliometric analysis toolkit, with particular emphasis on their application to environmental research questions.
Table 2: Research Reagent Solutions for VOSviewer Bibliometric Analysis
| Tool/Resource | Function | Application in Environmental Science |
|---|---|---|
| VOSviewer Software | Network visualization and analysis | Creates interpretable maps of research trends, collaborations, and conceptual structure in environmental science [20]. |
| Web of Science Core Collection | Primary data source | Provides comprehensive bibliographic data with consistent indexing, essential for tracking environmental research outputs [39]. |
| CiteSpace | Complementary analysis tool | Identifies emerging trends and burst concepts in environmental science literature when used with VOSviewer [39]. |
| Custom Thesauri | Term normalization | Standardizes variant environmental terminology (e.g., "global warming" vs. "climate change") for more accurate mapping. |
| JSON Configuration Files | VOSviewer parameter storage | Saves and shares optimal threshold settings for specific environmental science domains [22]. |
Threshold Strategy for Temporal Analysis:
The relationship between threshold parameters and visualization characteristics follows predictable patterns that can guide decision-making:
Threshold Impact on Map Characteristics
Environmental science increasingly intersects with other disciplines, creating challenges for threshold selection due to terminological diversity. For multidisciplinary topics like "One Health" or "Planetary Boundaries," consider these specialized approaches:
Staged Thresholding:
Cluster-Based Threshold Adjustment:
Cross-Database Validation:
Threshold selection in VOSviewer represents both a technical and conceptual challenge that directly influences the analytical value of bibliometric visualizations in environmental science research. The protocols and guidelines presented in this application note provide a systematic approach to balancing map detail and clarity, enabling researchers to generate visualizations that are both comprehensive and interpretable. As environmental science continues to evolve as an interdisciplinary field, appropriate threshold management becomes increasingly important for identifying emerging research trends, collaboration patterns, and knowledge structures. By applying these evidence-based threshold strategies, researchers can enhance the rigor and communicative power of their bibliometric analyses, ultimately supporting more informed decisions in research planning and policy development.
Bibliometric analysis has become an indispensable methodology in environmental science research, enabling the systematic mapping of knowledge domains and emerging trends. VOSviewer has emerged as a dominant tool in this landscape, distinguished by its powerful network visualization capabilities and user-friendly interface for creating "distance-based maps" where the proximity between items accurately reflects their similarity [40]. Despite its widespread adoption across various environmental research domains, including environmental security management and ecological risk assessment, researchers must acknowledge and develop strategies to address two significant analytical limitations: the lack of stemming functionality in term analysis and the inability to perform native temporal analysis [40]. These constraints present particular challenges in environmental science, where terminology evolves rapidly and understanding temporal trends is crucial for tracking emerging pollutants, policy impacts, and technological developments.
Stemming refers to the text processing technique that reduces words to their root form, allowing related terms to be analyzed as a single conceptual unit. VOSviewer's inability to perform stemming presents significant challenges in environmental science bibliometrics, where terminology frequently appears in variant forms.
Impact on Environmental Science Research:
Table 1: Common Environmental Science Terminology Affected by Lack of Stemming
| Root Concept | Variant Forms | Research Impact |
|---|---|---|
| Ecosystem Service | service, services | Fragmented analysis of key sustainability concepts |
| Climate Change | changing, changed climate | Incomplete assessment of research themes |
| Risk Assessment | assessing, assessed risk | Disconnected risk management literature |
| Environmental Security | security, securities | Compromised mapping of safety research domains |
| Pollution | pollute, polluted, polluting | Underestimation of pollution research volume |
VOSviewer lacks native capabilities for analyzing how research domains evolve, a critical limitation for environmental science where understanding temporal patterns is essential for tracking emerging contaminants, policy impacts, and technological adoption. While the software provides overlay visualizations, it cannot perform sophisticated time-series analysis or detect emerging trends algorithmically [40].
Consequences for Environmental Research:
Objective: Implement a standardized text preprocessing workflow to compensate for VOSviewer's lack of stemming functionality.
Materials and Software:
Methodology:
Objective: Establish a reproducible methodology for integrating temporal dimension into VOSviewer analyses using complementary bibliometric tools.
Materials and Software:
Methodology:
Table 2: Temporal Analysis Workflow for Environmental Science Research
| Research Phase | VOSviewer Application | Complementary Tool | Outcome Metrics |
|---|---|---|---|
| Data Preparation | Keyword co-occurrence per time period | Bibliometrix for conceptual structure map | Thematic evolution patterns |
| Trend Identification | Overlay visualization with publication year | CiteSpace for burst detection | Citation burst strength & duration |
| Network Evolution | Citation network analysis | SciMAT for thematic evolution | Thematic stability, emergence, disappearance |
| Research Front Analysis | Bibliographic coupling | CiteSpace for time-zone visualization | Research front progression |
A 2023 study on international environmental security management exemplifies the effective integration of VOSviewer with complementary tools to overcome its inherent limitations [41]. The research analyzed 7,596 articles from Web of Science spanning 1997-2021, forming six main clustering labels from 28,144 authors.
Stemming Compensation Approach: Researchers implemented manual terminology harmonization for environmental security concepts, grouping variants like "environmental safety" and "environmental security" through pre-processing before VOSviewer analysis. This enabled more accurate identification of research hotspots spanning personal health, society, agriculture, ecological environment, energy, and sustainable development.
Temporal Analysis Integration: While VOSviewer mapped the current research landscape and collaboration networks, CiteSpace provided critical temporal analysis through:
The hybrid methodology revealed that the United States maintains a dominant position in this research field, with China showing increasing collaboration with the United States, Britain, Australia, and India [41].
Table 3: Essential Analytical Tools for Comprehensive Bibliometric Analysis
| Tool/Software | Primary Function | Application in Environmental Science | Access |
|---|---|---|---|
| VOSviewer | Network visualization and clustering | Mapping research domains and collaboration networks | Free download |
| CiteSpace | Temporal and burst analysis | Identifying emerging trends and research fronts | Free download |
| Bibliometrix | Comprehensive bibliometric analysis | Performance analysis and science mapping | R package |
| Python NLTK | Text preprocessing and stemming | Terminology standardization pre-VOSviewer | Open source |
| Google Sheets | Data preprocessing and harmonization | Manual terminology cleaning | Web-based |
Based on experimental protocols and case studies, we propose a comprehensive workflow that compensates for VOSviewer's limitations while leveraging its strengths in visualization.
For Terminology Management:
For Temporal Analysis:
VOSviewer remains an invaluable tool for environmental science bibliometrics, particularly for its sophisticated visualization capabilities and user-friendly interface. However, researchers must acknowledge its limitations in stemming and temporal analysis, particularly in a field characterized by evolving terminology and pressing needs to understand temporal trends. The experimental protocols and integrated workflow presented here provide a robust methodology for compensating these limitations while leveraging VOSviewer's strengths. Through strategic complementarity with other bibliometric tools and careful attention to text preprocessing, environmental scientists can overcome these constraints to produce more accurate, comprehensive, and temporally-sensitive analyses of their research domains. This approach enables more effective mapping of the complex, evolving landscape of environmental science research, from ecosystem service-based risk assessment to emerging contaminants and sustainability transitions.
Within the framework of a broader thesis on the application of VOSviewer for bibliometric analysis in environmental science research, the ability to optimize map layouts and clustering is paramount. Effective visual presentation transforms complex network data into interpretable knowledge landscapes, enabling researchers to identify key themes, track emerging trends, and communicate findings clearly to diverse audiences, including scientists and drug development professionals. VOSviewer is specifically designed to construct and visualize various bibliometric networks based on scientific literature, facilitating the creation of co-occurrence, co-authorship, and citation networks [20]. The software's functionality allows for the visual exploration of patterns within textual and bibliographical data, making it an indispensable tool for mapping scientific fields [20]. This document provides detailed application notes and protocols for maximizing the effectiveness of VOSviewer's layout and clustering algorithms, with examples contextualized for environmental science.
Before optimizing a map, one must understand the type of network being analyzed. VOSviewer can construct several network types, each offering a different perspective on the scholarly landscape. The choice of network dictates the relationships that will be visualized and clustered.
Table: Common Bibliometric Network Types in VOSviewer
| Network Type | Defining Relationship | Research Insight Provided | Example from Environmental Science |
|---|---|---|---|
| Co-occurrence | Terms (e.g., keywords) appearing together in the same publication [42]. | The conceptual structure and main topics of a field. | Mapping the co-occurrence of terms like "microplastics," "bioaccumulation," and "ecotoxicology." |
| Co-authorship | Researchers or institutions collaborating on publications [42] [20]. | Collaborative networks and key partners. | Visualizing international collaboration on "carbon sequestration" research. |
| Citation | Documents or journals citing one another [42]. | The flow of knowledge and influence between entities. | Analyzing which foundational papers on "green synthesis" are most cited in recent drug development. |
| Bibliographic Coupling | Documents that share common references [42]. | Current research fronts working on similar problems. | Identifying groups of recent studies on "pharmaceutical pollution" that build on the same knowledge base. |
| Co-citation | Documents or journals being cited together by other documents [42]. | The intellectual foundations and seminal works of a field. | Revealing the core set of historical studies that underpin modern "environmental impact assessment." |
This protocol details the creation of a term co-occurrence network, one of the most common approaches for mapping a research field's conceptual structure.
1. Data Source Identification and Export:
TS=("green synthesis" AND "drug development" AND "environmental impact").2. Data Import and Network Construction in VOSviewer:
Create.Create a map based on bibliographic data and then Read data from bibliographic database files.Type of analysis, select Co-occurrence and then All keywords (or Author keywords for a more focused map).3. Map Initialization:
Finish to allow VOSviewer to calculate the network and generate an initial, unoptimized map.The default map layout often requires refinement to improve its interpretability. VOSviewer uses a visualization-based similarity technique to layout items [20].
1. Cluster-Colored Layout:
Items -> Clusters -> Use colors for clusters option. This assigns a distinct, high-contrast color to each cluster, making it immediately visible which items belong together.2. Label and Size Adjustment:
Label menu, adjust the Size and Scale parameters. Increase the minimum label size to ensure readability of key items. Use the Max. number of labels setting to show labels only for the most important items, preventing a "hairball" appearance.3. Density Visualization:
Density view via the main toolbar. Adjust the Kernel width and Edge width in the Density menu to control the smoothness and prominence of the color layers.The following diagram illustrates the logical workflow for optimizing a VOSviewer map, from data loading to final visualization.
Clustering is the process of partitioning a network into distinct groups, or clusters, of closely related items. In VOSviewer, this is automated but can be guided by the user.
Overlay visualization is a powerful feature for displaying the temporal evolution of research topics [20].
1. Data Preparation:
2. Score Calculation:
Overlay menu, select the Score tab. Choose the Publication years attribute.3. Visualization:
Table: Clustering Metrics and Their Impact on Map Interpretation
| Clustering Metric | Function | Impact on Thematic Analysis |
|---|---|---|
| Modularity | Measures the strength of division of a network into modules (clusters). | A high value indicates that the network has a strong community structure, validating the distinctness of the identified clusters. |
| Silhouette Score | Evaluates how similar an object is to its own cluster compared to other clusters. | A score close to 1 indicates items are well-matched to their own cluster and poorly-matched to neighboring clusters, confirming cluster cohesion and separation. |
| Number of Clusters | The total count of thematic groups identified by the algorithm. | A higher number requires more detailed interpretation but can reveal niche sub-fields. A lower number provides a high-level overview. |
| Average Publication Year (Overlay) | Colors items based on the temporal dimension of their activity [20]. | Directly reveals the evolution of the field, distinguishing between established core topics and emerging, trending research areas. |
This section details the essential digital "reagents" and tools required for conducting a VOSviewer-based bibliometric analysis.
Table: Essential Tools for VOSviewer Bibliometric Analysis
| Tool / Resource | Function | Role in the Analytical Workflow |
|---|---|---|
| VOSviewer Software | The primary application for constructing, visualizing, and exploring bibliometric networks [20]. | The core engine for data processing, layout calculation, clustering, and visualization. |
| Web of Science (WOS) Core Collection | A leading scholarly citation database used as the data source for robust bibliometric studies [34]. | Provides the raw, structured bibliographic data (titles, authors, abstracts, citations, etc.) required for analysis. |
| CiteSpace Software | Another scientometric software tool often used in conjunction with VOSviewer for complementary analyses like burst detection [34]. | Used for analyzing emerging trends and pivotal points in the literature, enriching the interpretation of VOSviewer maps. |
| Microsoft Excel | A ubiquitous spreadsheet application. | Used for preliminary data cleaning, managing exported records, and creating basic charts (e.g., annual publication trends) [34]. |
| VOSviewer Online | The web-based version of VOSviewer [42] [20]. | Allows for easy sharing and interactive exploration of created maps with collaborators or a wider audience, enhancing research dissemination. |
The relationship between these tools in a standard bibliometric workflow is outlined below.
VOSviewer is a freely available, Java-based software package specifically designed for constructing and visualizing bibliometric networks [43]. These networks can include relationships between a variety of scientific entities, such as journals, researchers, individual publications, and keywords, based on citation, co-citation, co-authorship, or co-occurrence data [44]. For researchers, scientists, and professionals in environmental science and drug development, bibliometric analysis has transitioned from a niche specialty to a fundamental preliminary research tool. It enables the mapping of vast scientific literatures to identify emerging trends, core publications, and collaborative networks. The software's user-friendly interface and its ability to create insightful, easy-to-interpret visualizations have contributed to its exponential growth in adoption, making it a critical component of the modern scientist's toolkit for literature discovery and research evaluation [43] [44].
VOSviewer supports several types of analysis, each serving a distinct purpose in exploring the scholarly landscape. Understanding these types allows researchers to select the appropriate methodology for their specific research question.
Table 1: Core Bibliometric Analysis Types in VOSviewer
| Analysis Type | Description | Primary Application in Environmental Science |
|---|---|---|
| Term Co-occurrence [44] | Identifies key thematic areas and their interlinkages by analyzing how frequently terms (e.g., keywords) appear together in publications. | Mapping the conceptual structure of a field like "microplastic pollution" to identify connected sub-themes such as "toxicity," "wastewater treatment," and "marine ecosystems." |
| Co-citation Analysis [44] | Identifies influential journals, references, and authors. A co-citation link exists between two items that are both cited by the same document. | Finding the foundational papers and key theorists in "environmental impact assessment" by seeing which references are consistently cited together. |
| Bibliographic Coupling [44] | Identifies countries, institutions, or publications working on similar topics. A link exists between two items that both cite the same document. | Revealing which European and Chinese research institutes are leading parallel research streams in "solar cell technology" development. |
The co-occurrence analysis is particularly valuable as a preliminary tool. It generates a network where the node size is proportional to the frequency of a term's co-occurrence, and the link thickness indicates the strength of the connection between two terms. Frequently co-occurring terms form distinct clusters, with each cluster typically representing a specific thematic research area [44]. This allows researchers to quickly grasp the intellectual structure of a field. For example, an exploratory study used VOSviewer to analyze keywords from the most cited papers on "smart cities" and "sustainable cities," revealing that they occupy largely distinct citation spaces, thus challenging the assumption that "smart cities" are inherently "sustainable cities" [43].
This protocol outlines the steps for using VOSviewer to perform an exploratory analysis of a research domain, using the example of investigating the intersection between remote sensing and the Sustainable Development Goals (SDGs) [43].
Table 2: Essential Materials and Software Tools
| Item Name | Function/Description | Source/Availability |
|---|---|---|
| Bibliographic Database | Source of publication and citation data. Provides metadata (titles, authors, abstracts, keywords, references) for analysis. | Scopus, Web of Science |
| VOSviewer Software | Java-based application for constructing, visualizing, and exploring bibliometric maps. | Freely available for download from https://www.vosviewer.com/ |
| Thesaurus File | A plain text file used to merge different variants of the same term (e.g., "color" and "colour," "SDG" and "Sustainable Development Goal"). | Created manually by the researcher based on knowledge of the field. |
Define Research Scope and Data Retrieval:
Data Preparation and Thesaurus Creation:
Perform Term Co-occurrence Analysis:
Interpretation and Visualization:
VOSviewer Analysis Workflow
VOSviewer can be used not just for exploration but also as a preliminary tool to test a simple hypothesis using publication data as a surrogate for real-world phenomena [43]. The workflow below outlines this advanced application.
Hypothesis Testing with VOSviewer
Protocol:
VOSviewer serves as a powerful and accessible tool for researchers seeking to navigate and understand complex scientific landscapes. Its value lies in its ability to transform large volumes of bibliographic data into intuitive visual maps, facilitating exploratory research and preliminary hypothesis testing. For environmental scientists and drug development professionals, mastering VOSviewer enables a data-driven approach to literature review, gap analysis, and research planning. By following the detailed protocols and utilizing the structured workflows outlined in this article, researchers can systematically integrate bibliometric analysis into their research process, thereby shaping more informed and impactful scientific inquiries.
Bibliometric analysis, facilitated by software like VOSviewer, provides powerful visualization of research landscapes through networks of citations, co-authorships, and term co-occurrences [6]. However, these computational outputs require rigorous validation through domain knowledge to ensure their scientific accuracy and practical relevance. Without proper validation, bibliometric findings risk representing statistical artifacts rather than genuine intellectual patterns. This protocol establishes comprehensive methodologies for validating VOSviewer-generated bibliometric maps within environmental science research, ensuring findings withstand scholarly scrutiny and provide meaningful insights for researchers, scientists, and drug development professionals.
The integration of quantitative bibliometric data with qualitative domain expertise creates a robust framework for interpreting complex research landscapes. This validation process is particularly crucial in environmental science, where research trends directly inform policy decisions and resource allocation. As demonstrated in a recent bibliometric analysis of environmental degradation research, which examined 1,365 publications, validation against domain knowledge helps ascertain whether frequently occurring terms like "economic growth" and "renewable energy" genuinely represent dominant research fronts rather than semantic artifacts [16].
Table: Key Validation Concepts in Bibliometric Analysis
| Concept | Definition | Validation Approach |
|---|---|---|
| Semantic Validation | Ensuring cluster labels accurately represent underlying concepts | Expert evaluation of term consistency and contextual relevance |
| Structural Validation | Verifying network relationships reflect genuine intellectual connections | Comparison with established citation classics and review articles |
| Temporal Validation | Confirming observed trends align with historical developments | Longitudinal analysis against known scientific milestones |
| Methodological Validation | Assessing appropriateness of visualization parameters | Sensitivity analysis of clustering resolution and correlation thresholds |
Bibliometric visualizations can mislead through several mechanisms. The rainbow color scheme previously used in VOSviewer, while visually appealing, could implicitly suggest non-existent categorical boundaries or obscure details in certain data ranges due to perceptual non-uniformity [5]. Version 1.6.7 replaced this with perceptually uniform color schemes like viridis, but interpretation challenges remain. Cluster boundaries may suggest discrete research areas when reality involves continuous intellectual gradients. Co-occurrence networks might reflect terminology preferences rather than conceptual relationships. Citation patterns can be influenced by disciplinary conventions rather than intellectual influence.
Table: Quantitative Metrics for Cluster Validation
| Metric | Calculation Method | Validation Threshold | Interpretation Guide |
|---|---|---|---|
| Cluster Silhouette Score | Average distance between items in same cluster vs. other clusters | >0.5 indicates strong clustering | Scores <0.25 suggest weak or artificial groupings |
| Term Consistency Ratio | Ratio of domain-relevant to generic terms within cluster | >60% for validated clusters | High generic term ratio indicates potential false cluster |
| Temporal Coherence Index | Standard deviation of publication years within cluster | <3 years for emerging topics | High deviation may indicate thematically disparate items |
| Citation Density Balance | Ratio of internal to external citations | >1.0 for well-defined clusters | Low ratios suggest fragmented intellectual foundations |
In the recent analysis of 1,365 environmental degradation publications, validation confirmed that "economic growth" represented a genuine research focus rather than a semantic artifact, evidenced by its central positioning across multiple visualization techniques and consistent co-occurrence with established theoretical frameworks like the Environmental Kuznets Curve [16]. The study demonstrated an annual publication growth rate exceeding 80%, with validated research fronts including renewable energy, urbanization drivers, and technological solutions – all confirming known domain priorities through quantitative measures.
Purpose: To verify that VOSviewer-generated cluster labels accurately represent the intellectual content of publications within each cluster.
Materials: VOSviewer cluster visualization output, list of publications per cluster, domain expert panel (3-5 experts), standardized evaluation forms.
Methodology:
Validation Criteria: Cluster labels require revision when (1) more than 30% of publications are misclassified by expert judgment, (2) alternative labels are proposed by multiple experts independently, or (3) temporal inconsistencies exist where historical and contemporary works are improperly grouped.
Purpose: To ensure observed bibliometric trends align with documented historical developments in the research domain.
Materials: VOSviewer overlay visualization, historical timeline of domain milestones, reference literature reviews.
Methodology:
Validation Criteria: Temporal trends are validated when (1) color gradients correlate with known scientific breakthroughs (>0.7 correlation), (2) seminal publications appear at expected timeline positions, and (3) trend discontinuities align with major policy changes or funding initiatives.
Purpose: To verify that bibliometric patterns are consistent across different literature databases, reducing database-specific biases.
Materials: Parallel datasets from Scopus, Web of Science, and Dimensions; VOSviewer software; correlation analysis tools.
Methodology:
Validation Criteria: Findings are considered database-independent when (1) cluster similarity indices exceed 0.75, (2) core author networks show >60% overlap, and (3) key term relationships remain stable across data sources.
Bibliometric Validation Workflow
Cluster Validation Decision Protocol
Table: Essential Validation Tools for Bibliometric Research
| Tool/Resource | Function | Application in Validation |
|---|---|---|
| VOSviewer Software | Constructing and visualizing bibliometric networks [6] | Primary tool for generating co-occurrence, citation, and co-authorship maps requiring validation |
| Perceptually Uniform Color Schemes | Viridis, magma, and other accessible palettes [5] | Ensure visualizations are interpretable by all users, including those with color vision deficiencies |
| Domain Expert Panels | Qualitative assessment of cluster integrity and label accuracy | Provide ground truth evaluation of bibliometric patterns against domain knowledge |
| Multiple Literature Databases | Scopus, Web of Science, Dimensions, OpenAlex [6] | Cross-database verification to eliminate platform-specific biases in literature coverage |
| Historical Milestone Databases | Curated timelines of scientific breakthroughs | Temporal validation of observed research trends and emergence patterns |
| Color Vision Deficiency Simulators | Tools like Coblis, Adobe Color [45] | Testing visualization accessibility for various color blindness types (deuteranopia, protanopia, tritanopia) |
| Quantitative Metric Suites | Silhouette scores, consistency ratios, coherence indices | Numerical validation of cluster quality and pattern robustness |
The application of these validation protocols is particularly crucial in environmental science, where research trends directly influence policy and funding decisions. The bibliometric analysis of environmental degradation research exemplifies proper validation, where apparent trends like the dominance of economic growth studies were confirmed through multiple validation protocols [16]. The researchers cross-referenced VOSviewer outputs with domain knowledge, confirming that China, Pakistan, and Turkey genuinely led research output rather than this pattern resulting from database biases.
Environmental science bibliometrics must also validate emerging trends against known scientific developments. For instance, the rapid growth of renewable energy literature should align with technological advancements and policy initiatives. The validation protocols ensure that such patterns reflect genuine intellectual shifts rather than terminological changes or database artifacts. This is especially important when identifying research gaps, such as the under-exploration of AI and metaverse applications in environmental degradation research identified in the recent study [16].
Validating bibliometric findings with domain knowledge transforms VOSviewer visualizations from suggestive patterns into reliable research tools. Through structured protocols encompassing semantic, structural, temporal, and methodological validation, researchers can confidently interpret bibliometric maps and derive meaningful insights. The integration of quantitative metrics with qualitative expert assessment creates a robust framework for validation, particularly crucial in applied fields like environmental science where research trends inform real-world decisions. As bibliometric methodology evolves, continuing to strengthen these validation practices ensures the growing influence of bibliometrics remains grounded in scholarly rigor.
Bibliometric analysis has become an indispensable methodology for mapping the intellectual structure and evolutionary trends within scientific domains, particularly in environmentally-focused research where understanding global collaboration and emerging topics is crucial [46]. The evolution of bibliometric analysis from its origins in library science to its current role as a cornerstone of research evaluation reflects the broader transformation of academic research in the digital age [46]. This expansion has been facilitated by specialized software tools that enable sophisticated data visualization and analysis, with VOSviewer, Bibliometrix, and CiteSpace emerging as three prominent solutions. Each tool offers unique capabilities for performance analysis and science mapping—the two key dimensions of bibliometric analysis that focus respectively on productivity/impact and the conceptual, intellectual, and social structure of research domains [46].
Within environmental science research, where interdisciplinary collaboration and rapidly evolving research fronts are commonplace, selecting the appropriate bibliometric tool is critical for generating meaningful insights. This application note provides a systematic comparison of these three software tools, evaluating their respective strengths and weaknesses specifically for applications in environmental research contexts such as climate change adaptation [47], environmental degradation [16], ecological impacts of energy systems [48], and ESG performance [49]. By presenting structured comparisons, experimental protocols, and practical workflows, this analysis equips researchers with the knowledge to select and implement the most appropriate tool for their specific research objectives.
VOSviewer (Visualization of Similarities viewer) was developed by Nees Jan van Eck and Ludo Waltman at Leiden University's Centre for Science and Technology Studies [14]. Its technical foundations are detailed in numerous publications, with the first technical paper appearing in 2007 [14]. The software implements the VOS (Visualization of Similarities) mapping technique and a smart local moving algorithm for large-scale modularity-based community detection [14]. VOSviewer is designed specifically for constructing, visualizing, and exploring bibliometric maps based on network data from bibliographic databases.
CiteSpace, developed by Chaomei Chen, is a Java-based application specializing in visual exploratory analysis of emerging trends and transient patterns in scientific literature [46]. It employs algorithms for detecting burst terms and betweenness centrality to identify pivotal points in research networks. CiteSpace is particularly noted for its temporal analysis capabilities, enabling researchers to track the evolution of research fields over discrete time periods.
Bibliometrix is an open-source R package complemented by a web-based interface called Biblioshiny. It offers a comprehensive suite of tools for quantitative research in bibliometrics and scientometrics [46]. Unlike the other tools, Bibliometrix leverages the statistical capabilities of the R environment, allowing for advanced statistical analysis and customization through programming.
Table 1: Fundamental Characteristics of Bibliometric Software Tools
| Characteristic | VOSviewer | Bibliometrix | CiteSpace |
|---|---|---|---|
| Primary Developer | Van Eck & Waltman (Leiden University) | Massimo Aria & Corrado Cuccurullo | Chaomei Chen |
| Initial Release | 2007-2009 [14] | 2017 | 2004 |
| Programming Language | Java | R (Biblioshiny web interface) | Java |
| Software Type | Standalone desktop application | R package with web interface | Standalone desktop application |
| License Model | Freeware | Open-source (R package) | Freeware for non-commercial use |
| Data Integration | Supports multiple data formats | Extensive R integration | Java-based framework |
| System Requirements | Java Runtime Environment | R environment | Java Runtime Environment |
Each tool offers distinct analytical capabilities that determine its suitability for specific research questions in environmental science. VOSviewer excels in creating clear, intuitive visualizations of co-occurrence networks, with special optimization for keyword co-occurrence analysis and citation networks [14] [50]. Its visualization approach emphasizes the clarity of network maps through the VOS clustering technique, which is particularly valuable for identifying major research themes in environmental domains such as "economic growth, renewable energy, and the Environmental Kuznets Curve" [16]. The software's accessibility and responsive interface make it suitable for researchers without programming expertise [16].
Bibliometrix provides the most comprehensive statistical analysis capabilities among the three tools, leveraging the full power of the R environment [46]. It supports the entire bibliometric analysis workflow from data import to visualization, with particular strength in performance analysis including author productivity, source impact, and country-level contributions. This makes it valuable for environmental studies requiring detailed statistical assessment of research output, such as analyzing global contributions to climate change adaptation research [47].
CiteSpace specializes in temporal analysis of research frontiers and emerging trends. Its unique strength lies in detecting burst terms and visualizing the evolution of research fields through time-sliced networks [46]. This capability is particularly useful for tracking the development of fast-evolving environmental topics like "wind-PHS coupling and life-cycle assessment" in energy storage research [48]. CiteSpace also provides advanced metrics like betweenness centrality for identifying pivotal points in research networks.
Table 2: Comparative Analysis of Core Functionalities
| Functionality | VOSviewer | Bibliometrix | CiteSpace |
|---|---|---|---|
| Network Types | Co-authorship, co-citation, co-occurrence, bibliographic coupling [14] | Comprehensive including coupling, co-citation, collaboration networks | Co-citation, collaboration, co-occurrence, thematic evolution |
| Visualization Features | Cluster-based maps, density views, overlay visualizations [14] | Various plot types including thematic maps, factorial analysis | Time-sliced networks, burst detection, betweenness centrality |
| Data Compatibility | Scopus, WoS, PubMed, RIS | Scopus, WoS, Dimensions, PubMed | WoS, Scopus, Dimensions, PubMed |
| Learning Curve | Gentle | Moderate to steep (depending on R proficiency) | Steep |
| Customization Options | Moderate through GUI | High through R programming | Moderate through GUI |
| Collaboration Analysis | Author, organization, country | Comprehensive collaboration networks | Author, institution, country |
| Thematic Evolution | Limited | Thematic evolution, factorial analysis | Specialized timeline and timezone views |
A standardized data collection protocol is essential for rigorous bibliometric analysis across all three tools. For environmental research applications, the following protocol ensures comprehensive data retrieval:
Database Selection: Identify primary data sources—typically Scopus and Web of Science (WoS)—based on coverage of environmental literature [16] [47]. Scopus often provides broader coverage of environmental journals, while WoS offers more selective indexing.
Search Query Development: Formulate targeted search strings using Boolean operators. For example:
Time Frame Specification: Define appropriate temporal boundaries based on research objectives. Many environmental studies cover decades to capture evolution of the field [16], while others may focus on specific periods of high activity (e.g., 2014-2024 for climate adaptation research [47]).
Data Export: Export full bibliographic records in the appropriate format for each tool:
Data Cleaning: Implement standardization procedures for author names, affiliations, and keywords using the preprocessing capabilities of each tool or external scripting.
Figure 1: Bibliometric Data Collection and Analysis Workflow
For analyzing environmental research networks in VOSviewer:
Data Import: Use "Create" function to build maps from bibliographic data, selecting the appropriate map type (co-authorship, co-occurrence, citation, or bibliographic coupling) [14].
Co-occurrence Analysis: For identifying research themes in environmental science:
Network Visualization and Interpretation:
Citation Analysis: Employ citation-based networks to identify foundational papers and emerging highly-cited works in environmental research.
For environmental research evaluation using Bibliometrix:
Data Loading: Use the Biblioshiny web interface or R commands to import and convert bibliographic data.
Performance Analysis:
Science Mapping:
Statistical Reporting: Generate comprehensive summary statistics describing the literature dataset.
For analyzing evolution of environmental research fronts using CiteSpace:
Project Setup: Configure time slicing parameters to divide the dataset into sequential periods (typically 1-3 year slices).
Burst Detection: Apply Kleinberg's algorithm to identify suddenly popular topics (e.g., emerging environmental concepts like "ESG performance" or "pumped hydro storage") [49] [48].
Betweenness Centrality Calculation: Identify pivotal papers that connect different research clusters in environmental science.
Timeline and Timezone Visualization: Generate temporal views showing the emergence, evolution, and decline of research themes.
A recent bibliometric analysis of environmental degradation research exemplifies VOSviewer's application, analyzing 1365 papers to identify key trends and patterns [16]. The analysis revealed:
This study demonstrated VOSviewer's strength in creating intuitive visualizations that "provide a strategic roadmap for future research" in environmental science [16].
Research on institutional dynamics in climate change adaptation employed bibliometric analysis to identify research patterns and geographical distributions [47]. The study revealed:
Figure 2: Tool Selection Guide for Environmental Research Questions
Table 3: Tool Performance in Specific Environmental Research Applications
| Research Application | VOSviewer | Bibliometrix | CiteSpace |
|---|---|---|---|
| Climate Change Adaptation | Effective for mapping research themes | Superior for analyzing geographical contributions and collaboration | Strong for tracking evolution of adaptation strategies |
| Environmental Degradation | Excellent for determinant identification [16] | Comprehensive for statistical trends | Effective for detecting emerging determinants |
| Energy Storage Research | Good for technology relationship mapping | Strong for publication output analysis | Superior for tracking technology evolution [48] |
| ESG Performance Studies | Effective for conceptual structure | Comprehensive for interdisciplinary analysis | Strong for identifying emerging ESG topics [49] |
| Pollution and Carbon Emissions | Optimal for co-occurrence network visualization | Excellent for temporal production analysis | Effective for identifying research fronts |
Table 4: Essential Research Reagents for Bibliometric Analysis in Environmental Science
| Research Reagent | Function | Example Sources/Tools |
|---|---|---|
| Bibliographic Databases | Source data for analysis | Scopus, Web of Science, Dimensions |
| Data Extraction Tools | Export and format bibliographic data | Scopus Export, WOS Export Utilities |
| Reference Managers | Organize and preprocess references | Zotero, Mendeley, EndNote |
| Statistical Software | Complementary statistical analysis | R, Python, SPSS |
| Text Mining Tools | Enhance keyword processing | Natural Language Processing libraries |
| Visualization Platforms | Supplementary visualization | Gephi, Tableau, Microsoft Power BI |
For researchers requiring comprehensive analysis, an integrated workflow leveraging multiple tools provides the most robust approach:
Data Collection and Preparation: Use Bibliometrix for initial data importing and cleaning due to its flexible data handling capabilities.
Performance Analysis: Employ Bibliometrix for comprehensive productivity and impact assessment of countries, institutions, authors, and journals.
Science Mapping: Utilize VOSviewer for clear, interpretable network visualizations of research themes and intellectual structure.
Temporal Analysis: Implement CiteSpace for detecting emerging trends and visualizing the evolution of research fronts.
Results Integration: Synthesize findings from all tools to develop a complete picture of the research landscape.
This integrated approach compensates for the limitations of individual tools while leveraging their respective strengths, ultimately producing more rigorous and insightful bibliometric assessment of environmental research domains.
VOSviewer, Bibliometrix, and CiteSpace each offer unique value propositions for bibliometric analysis in environmental science research. VOSviewer excels in creating accessible, interpretable network visualizations with particular strength in co-occurrence analysis. Bibliometrix provides the most comprehensive statistical toolkit with seamless integration into the R ecosystem. CiteSpace offers unparalleled capabilities for temporal analysis and detection of emerging research fronts.
Tool selection should be guided by research objectives: VOSviewer for intuitive visualization and clustering analysis, Bibliometrix for comprehensive performance assessment and statistical analysis, and CiteSpace for investigating temporal patterns and emerging trends. For complex environmental research questions, an integrated approach leveraging the complementary strengths of all three tools often yields the most robust and actionable insights for researchers, policymakers, and environmental professionals seeking to understand the evolving landscape of sustainability science.
In the field of environmental science research, bibliometric analysis has become an indispensable technique for mapping the intellectual structure and evolution of scholarly fields. The reliability and comprehensiveness of such analyses are fundamentally dependent on the quality and scope of the underlying bibliographic data. Proprietary databases like Scopus and emerging open sources like OpenAlex each present unique advantages and limitations in coverage, particularly across different geographic and disciplinary domains [51]. Cross-verification using multiple data sources mitigates the inherent biases of any single database, ensuring a more robust and reproducible analysis. This protocol provides a detailed framework for integrating and validating data from Scopus, Dimensions, and OpenAlex specifically for bibliometric studies in environmental science using VOSviewer software.
Table 1: Essential Materials and Software for Bibliometric Cross-Verification
| Item Name | Function/Application | Specification Notes |
|---|---|---|
| Scopus API | Retrieving structured bibliographic data from Elsevier's Scopus database. | Requires institutional subscription; offers comprehensive coverage of peer-reviewed literature [52]. |
| OpenAlex API | Accessing open scholarly metadata; a continuation of Microsoft Academic Graph [51]. | Freely available; permits reproducible bibliometrics without licensing barriers [52] [51]. |
| VOSviewer | Creating and visualizing bibliometric maps based on network data. | Specialized functions for collaboration, topic, and citation analysis [12]. |
| Bibliographic Data Parser | Cleaning and standardizing records from different sources (e.g., authors, institutions). | Custom scripts in Python, R, or SQL are often necessary for data harmonization [52]. |
| Reference Matching Script | Identifying and linking duplicate publications across databases. | Crucial for creating a unified, non-redundant dataset for analysis. |
A critical first step in cross-verification is understanding the core characteristics and comparative performance of the available databases. The following table summarizes key metrics based on recent large-scale studies.
Table 2: Data Source Characteristics for Bibliometric Analysis in Environmental Science
| Characteristic | Scopus | OpenAlex | Dimensions |
|---|---|---|---|
| Provider & Business Model | Elsevier (Proprietary) | OurResearch (Open Access) | Digital Science (Proprietary) |
| Coverage (General) | Extensive, selective | Very extensive, inclusive | Extensive |
| Internal Reference Coverage | High | Comparable to Scopus and WoS on shared publications [51] | Information Missing |
| Metadata Completeness | High | Mixed (e.g., more ORCIDs, fewer abstracts) [51] | Information Missing |
| Primary Application | Disciplined, traditional bibliometrics | Reproducible, large-scale bibliometrics [51] | Information Missing |
| Key Strength | Well-curated metadata | Permissive licensing for open research | Information Missing |
| Key Limitation | Licensing cost and restrictions | Rapidly evolving and changing data [51] | Information Missing |
Recent analyses indicate that when restricted to a cleaned dataset of recent publications shared across Scopus, Web of Science, and OpenAlex, OpenAlex demonstrates average source reference numbers and internal coverage rates that are comparable to both Web of Science and Scopus [51]. This makes it a viable and powerful open-source alternative for many bibliometric applications. However, its metadata coverage is mixed, capturing more ORCID identifiers but fewer abstracts than its proprietary counterparts [51].
This protocol outlines the steps for gathering data from multiple APIs and standardizing it for analysis.
I. Research Question Formulation Define clear research questions to guide search strategy, data collection, and analysis. For example: "What is the thematic evolution of climate change adaptation research from 2015 to 2025?"
II. Search Strategy Development
III. Data Retrieval via APIs
IV. Data Cleaning and Harmonization
V. Data Integration and De-duplication
VI. Final Dataset Creation
Data Retrieval and Harmonization Workflow
This protocol describes methods to validate the coverage and consistency of the retrieved dataset.
I. Coverage Analysis
II. Benchmarking with a Gold Standard
III. Consistency Checks
This protocol details the creation of bibliometric maps using the cross-verified dataset, incorporating best practices for threshold selection.
I. Data Preparation and Import
II. Threshold Setting
III. Map Creation and Interpretation
VOSviewer Analysis with Thresholding
The integration and cross-verification of data from Scopus, Dimensions, and OpenAlex establishes a rigorous foundation for bibliometric analysis in environmental science. This multi-source approach leverages the respective strengths of each database—be it the curated metadata of Scopus or the open and inclusive nature of OpenAlex—while mitigating their individual biases. By adhering to the detailed protocols for data retrieval, harmonization, and validation outlined above, researchers can utilize VOSviewer to generate more accurate, reliable, and comprehensive maps of scientific knowledge, thereby enhancing the integrity and reproducibility of their research outcomes.
In the rapidly evolving field of environmental science, where research directly informs critical policy and conservation decisions, accurately assessing scholarly impact has never been more important. Traditional citation counts, while valuable for measuring academic influence, provide an incomplete picture of a study's true reach and significance. This limitation is particularly pronounced in environmental science, where research often influences policy, public awareness, and industrial practices beyond academic circles. The integration of bibliometric analysis using visualization tools like VOSviewer represents a paradigm shift in research assessment, enabling scholars to identify emerging trends, map intellectual networks, and contextualize citation metrics within broader scientific landscapes. This methodological approach allows researchers to transition from simply counting citations to understanding the complex relationships between ideas, authors, and institutions that drive scientific progress in environmental domains.
The evolution of bibliometric methodology from basic citation counting to sophisticated network analysis reflects a growing recognition that scientific impact is multidimensional. Modern bibliometric tools like VOSviewer, developed by researchers at Leiden University's Centre for Science and Technology Studies [5], enable both performance analysis and science mapping, providing insights into the structural and dynamic aspects of scientific research. For environmental scientists, this means being able to track the development of key concepts such as "blue economy," "environmental degradation," or "sustainable financial inclusion" across time and geographic boundaries, identifying not just what is being cited, but how ideas cluster and evolve in response to global environmental challenges.
The assessment of research impact has undergone significant transformation since the early days of simple citation counting. Bibliometrics, a term first introduced by Otlet in the 1930s and popularized by Pritchard in 1969 [53], has evolved from basic publication counts to sophisticated analyses of scientific networks and knowledge structures. This evolution mirrors the increasing complexity of environmental research itself, which requires interdisciplinary approaches to address multifaceted challenges like climate change, biodiversity loss, and sustainable development.
Traditional citation analysis, while useful for measuring academic influence, suffers from several limitations in environmental science contexts. It often favors established topics over emerging ones, overlooks non-academic impact, and fails to capture the relational aspects of knowledge production. The integration of tools like VOSviewer has addressed these gaps by enabling:
These advanced techniques allow environmental scientists to visualize the intricate knowledge networks that underlie scientific progress, moving beyond simple quantitative measures to qualitative understanding of how research ideas connect and evolve.
While traditional citations remain important, several complementary metrics have emerged to provide a more nuanced understanding of research impact:
Table: Emerging Research Impact Metrics in Environmental Science
| Metric Category | Specific Examples | Application in Environmental Science |
|---|---|---|
| Citation Enhancements | Field-Weighted Citation Impact, Citation Percentiles | Contextualizes citation performance within specific subfields like climate science or conservation biology |
| Alternative Metrics | Altmetric Attention Score, Social Media Mentions | Captures policy uptake, public engagement, and media coverage of environmental research findings |
| Network Metrics | Betweenness Centrality, Modularity Class | Identifies bridging studies that connect different research communities or thematic clusters |
| Temporal Metrics | Burst Detection, Half-Life | Pinpoints rapidly emerging topics and sustainability of research influence over time |
The Altmetric Attention Score (AAS) has particular relevance for environmental science, where research often informs public policy and conservation practice. This metric quantifies attention across news outlets, social media, policy documents, and other non-academic sources, capturing impact that traditional citations might miss [54]. For example, a study on plastic pollution might receive modest citation counts but generate significant policy discussions and public awareness, reflected in its AAS.
VOSviewer (Visualization of Similarities viewer) is a specialized software tool for constructing and visualizing bibliometric networks that has become increasingly prominent in environmental science research. Developed by van Eck and Waltman at Leiden University [12] [5], this Java-based application provides user-friendly functionality for analyzing bibliometric data through multiple visualization techniques:
The software supports several types of bibliometric analysis particularly relevant to environmental research, including co-authorship (between researchers, organizations, countries), co-citation (of references, authors, journals), and co-occurrence (of keywords, terms) [12]. Recent versions have introduced improved color schemes such as "viridis" and "tab20" to replace the problematic rainbow palette, enhancing perceptual uniformity and accessibility for color-blind users [5].
VOSviewer operates within a broader ecosystem of bibliometric tools and typically follows data extraction from major databases like Scopus and Web of Science. The software imports data in RIS or CSV formats and can process thousands of records simultaneously, making it suitable for comprehensive analyses of environmental research domains. Its compatibility with other tools like CiteSpace [54] and Bibliometrix [55] allows researchers to combine multiple analytical approaches, validating findings through methodological triangulation.
For environmental scientists, VOSviewer's ability to handle large datasets is particularly valuable given the interdisciplinary and collaborative nature of the field. Studies analyzing trends in sustainable development [11], environmental degradation [16], or climate change research typically involve thousands of publications across multiple subdisciplines, requiring robust software capable of mapping complex knowledge structures without sacrificing analytical nuance.
Table: Data Collection Protocol for Environmental Science Bibliometrics
| Step | Procedure | Tools | Quality Control |
|---|---|---|---|
| Database Selection | Select Scopus, Web of Science, or both based on coverage needs | Scopus, WoS | Compare initial results to assess database-specific biases |
| Search Strategy | Develop comprehensive search strings using Boolean operators | Database interfaces | Validate search sensitivity and specificity with test sets |
| Time Frame | Define appropriate temporal range (e.g., 2000-2025) | - | Justify time period based on research questions |
| Export Parameters | Export full record and cited references | RIS, Plain text | Verify complete metadata extraction |
| Data Cleaning | Remove duplicates, standardize terms, complete metadata | OpenRefine, Excel, R | Implement systematic deduplication protocol |
Protocol Details:
Database Selection: Choose between Scopus and Web of Science based on disciplinary coverage. Scopus generally provides broader coverage of environmental journals, while WoS offers more consistent citation data. For comprehensive analyses, use both databases and merge results after deduplication [56] [55].
Search Strategy Development: Formulate structured search queries using title-abstract-keyword fields. For example, in sustainable energy research: TITLE-ABS-KEY ("renewable energy" AND "policy" AND "developing countries"). Test search sensitivity by verifying known key papers are included [16].
Time Frame Determination: Select appropriate time frames based on research questions. For emerging trends, recent 5-10 year periods may suffice; for evolutionary analysis, longer timeframes (20+ years) are necessary [11].
Data Export: Export complete bibliographic records including citations, references, abstracts, and keywords. For VOSviewer analysis, the "full record and cited references" export option is recommended [54].
Data Cleaning: Implement systematic cleaning procedures:
Network Construction Steps:
Data Import: Load the processed data into VOSviewer using the "Create" function based on bibliographic database files.
Analysis Type Selection: Choose appropriate analysis type:
Threshold Setting: Apply frequency thresholds to focus on most relevant items. For environmental science reviews, typical initial thresholds might be:
Mapping Parameters:
Visualization Refinement:
Bibliometric Analysis Workflow
Cluster Analysis: Identify major thematic clusters based on network modularity. Label clusters by examining central terms and highly cited documents within each group.
Temporal Analysis: Use overlay visualization to track concept evolution. Color code by average publication year to identify emerging (recent) versus established (older) topics.
Network Metrics Interpretation:
Validation Procedures:
Bibliometric analysis using VOSviewer has revealed significant insights into the evolution of sustainable development research. A comprehensive analysis of Sustainable Inclusive Economic Growth (SIEG) within the SDG 8 framework documented a substantial increase in research output post-2015, with a notable surge after 2019 as global efforts toward the UN 2030 Agenda intensified [11]. The analysis identified China, India, and Italy as the most productive countries, while "Sustainability (Switzerland)" ranked as the leading journal in this domain.
Thematic evolution analysis revealed a distinct shift from earlier focus areas like financial inclusion and corporate social responsibility (2014-2023) toward emerging topics like digital economy, blue economy, employment, and entrepreneurship (2024-2025) [11]. This temporal mapping provides valuable intelligence for researchers and policymakers seeking to align investigations with evolving priorities in sustainability science.
In environmental degradation research, VOSviewer analysis of 1,365 papers revealed an astonishing annual publication growth rate exceeding 80%, reflecting intensified global focus on sustainability challenges [16]. The analysis identified economic growth as the most frequently studied factor connected to environmental degradation, with particular emphasis on themes like renewable energy and the Environmental Kuznets Curve.
Network visualization demonstrated how energy consumption, globalization, and urbanization drive carbon emissions research, with China, Pakistan, and Turkey leading research output. The bibliometric approach helped identify emerging research hotspots, including the role of advanced technologies like artificial intelligence and the Metaverse, as well as behavioral and psychological factors influencing environmental degradation [16].
A bibliometric analysis of sustainable financial inclusion research revealed eight distinct thematic clusters, including digital finance, ESG integration, green finance, and financial literacy, demonstrating the multidimensional nature of this evolving field [56]. The analysis documented rapid growth since 2017, led by China, India, and the United States, while also revealing geographic imbalances and underrepresentation of Sub-Saharan Africa and Central Asia regions.
The VOSviewer mapping identified major barriers to sustainable financial inclusion, including financial illiteracy and uncoordinated regulations among institutions, providing actionable intelligence for policymakers seeking to align inclusive finance with Sustainable Development Goals [56].
Table: Essential Bibliometric Tools for Environmental Research Analysis
| Tool Name | Primary Function | Application in Environmental Science | Access |
|---|---|---|---|
| VOSviewer | Network visualization and clustering | Mapping thematic evolution in environmental research domains | Free download |
| Bibliometrix (R-tool) | Comprehensive bibliometric analysis | Performance analysis of countries, institutions, authors | R package |
| CiteSpace | Burst detection and temporal analysis | Identifying emerging trends and paradigm shifts | Free download |
| SCImago Graphica | Geographic mapping of research output | Visualizing regional contributions to environmental research | Free download |
| Google Scholar | Broad literature search | Complementary coverage beyond subscription databases | Web access |
Implementation Notes:
The effective application of these tools requires thoughtful integration into the research workflow. VOSviewer excels at visualization and cluster identification, while Bibliometrix provides more robust performance analysis capabilities. For environmental scientists studying rapidly evolving fields like climate change adaptation or plastic pollution research, CiteSpace's burst detection functionality can identify suddenly popular topics that might represent research fronts.
Many research groups employ a sequential approach where Bibliometrix is used for initial data screening and performance analysis, followed by VOSviewer for network construction and visualization, with CiteSpace adding specialized temporal analysis for trend identification. This tool combination provides methodological triangulation, strengthening the validity of bibliometric findings.
Effective interpretation of VOSviewer maps requires understanding key visualization principles:
The software offers multiple visualization modes suited to different analytical questions in environmental research. Density visualization helps identify knowledge concentrations, overlay visualization reveals temporal trends, and network visualization displays relational structures between research constituents.
Network Map Interpretation Guide
Beyond basic network interpretation, several advanced techniques enhance the value of bibliometric analysis for environmental research:
Overlay Visualization for Trend Analysis: Using the color gradient from blue (older) to yellow (newer) in the viridis color scheme to identify emerging topics at the research frontier [5].
Burst Detection: Identifying concepts with sudden increases in frequency that may indicate emerging research fronts or responding to environmental crises.
Geographical Mapping: Integrating bibliometric findings with geographic visualization to reveal regional specialization and international collaboration patterns in environmental research.
Multilevel Analysis: Conducting simultaneous analysis at multiple levels (authors, institutions, countries) to understand scale-dependent patterns in knowledge production.
For example, in analyzing climate change adaptation research, overlay visualization might reveal shifting emphasis from general vulnerability assessment to specific resilience strategies, while geographical mapping could identify leading regions and potential collaboration opportunities for knowledge transfer.
Bibliometric analysis has evolved into a crucial methodology for quantitatively assessing scholarly literature, enabling the systematic mapping of research landscapes across scientific domains. This approach utilizes quantitative data analysis and network visualization to identify emerging trends, intellectual structures, and collaborative patterns within scientific literature [57]. In environmental science, where research questions are complex and funding resources are competitive, bibliometrics provides evidence-based insights that help shape research agendas and strategically allocate funding resources. The integration of specialized software tools like VOSviewer has significantly enhanced our capacity to process and visualize large bibliometric datasets, revealing patterns that might otherwise remain obscured in conventional literature reviews [58]. This application note examines the methodological protocols and practical applications of bibliometric analysis in guiding research priorities within environmental science, with specific focus on VOSviewer implementation.
Bibliometric analyses have revealed several pivotal research trends and shifts in environmental science. Studies utilizing VOSviewer have demonstrated a substantial rise in research focusing on ecological product valuation and ecosystem services, particularly following international policy frameworks such as the United Nations Sustainable Development Goals [36]. The analysis of publication trends has enabled researchers to track the evolution from initial conceptual exploration to global cooperation and policy application phases in ecosystem service-based ecological risk assessment (ESRA) [58].
Research in environmental degradation has shown an accelerating publication growth rate exceeding 80% annually, with particular emphasis on themes like economic growth, renewable energy, and the Environmental Kuznets Curve [16]. The analysis of 1,365 research papers in this domain revealed that economic growth remains the most extensively studied factor, with China, Pakistan, and Turkey emerging as leading contributors to the research output [16].
In the microplastics research domain, bibliometrics has uncovered an explosive growth in publications, particularly from 2014 to 2023, with research expanding from marine environments to terrestrial and atmospheric systems [4]. This analysis has helped identify four major research clusters: distribution and sources, toxic effects, analytical methods, and interactions with other pollutants [4].
Table 1: Key Research Trends in Environmental Science Identified Through Bibliometric Analysis
| Research Domain | Primary Trends Identified | Temporal Pattern | Leading Contributing Countries |
|---|---|---|---|
| Ecological Product Value | Ecosystem services valuation, Policy frameworks | Two-phase growth: starting/exploring (1993-2010) and rapid development (2011-2023) | China, United States, European nations [36] |
| Environmental Degradation | Economic growth, Renewable energy, Environmental Kuznets Curve | Annual growth >80%, particularly accelerated since 2015 | China, Pakistan, Turkey [16] |
| Microplastic Pollution | Distribution pathways, Toxicological effects, Analytical methods | Explosive growth since 2014, 3,548 publications in 2022 alone | China, USA, UK, Australia, Canada [4] |
| Ecosystem Service Risk Assessment | Landscape ERA, Aquatic ecosystems, Ecosystem health | Four-stage evolution: initial development (1994-2005) to global cooperation | Not specified [58] |
Bibliometric analysis directly shapes research agendas by identifying knowledge gaps and emerging frontiers in scientific literature. The visual mapping of keyword co-occurrence and evolution over time allows researchers to detect shifting priorities and underexplored areas requiring investigation [36]. For instance, in ecological product value research, bibliometric analysis has highlighted the need for more comprehensive value development, improved value realization pathways, and refined accounting methodologies [36].
The analysis of collaboration networks has revealed substantial international cooperation patterns, with countries like China, the United States, and the United Kingdom forming central hubs in microplastics research networks [4]. These insights help funding agencies promote strategic international partnerships and allocate resources to regions where research capacity building is most needed.
Citation analysis has further enabled the identification of seminal works and conceptual foundations within environmental research domains. For example, in process safety and environmental protection, bibliometric mapping revealed influential publications and research trends that have shaped the field's development over three decades [57]. This helps new researchers quickly grasp the intellectual structure of the field and identify foundational knowledge.
Funding agencies increasingly utilize bibliometric analysis to inform strategic prioritization and resource allocation. The quantitative assessment of publication outputs, citation impacts, and collaboration networks provides objective criteria for evaluating research productivity and impact [57]. Bibliometric indicators have become valuable tools for assessing the return on investment in research funding and identifying promising areas for future investment.
The analysis of research fronts and emerging topics allows funding agencies to support cutting-edge investigations in areas such as the role of advanced technologies like artificial intelligence and the Metaverse in environmental science, as well as behavioral and psychological factors influencing environmental degradation [16]. These analyses help anticipate future research directions rather than merely responding to past trends.
Bibliometric mapping has also supported the identification of interdisciplinary opportunities where environmental science converges with other domains. This enables funding agencies to promote cross-disciplinary initiatives that address complex environmental challenges through integrated approaches [58].
Objective: To systematically collect and preprocess bibliographic data from authoritative databases for analysis in VOSviewer.
Materials and Reagents:
Procedure:
Troubleshooting Tips:
Objective: To analyze and visualize bibliometric networks using VOSviewer software.
Materials and Reagents:
Procedure:
Troubleshooting Tips:
Objective: To identify evolutionary trends and forecast future research directions.
Materials and Reagents:
Procedure:
Troubleshooting Tips:
Table 2: Essential Research Reagents and Tools for Bibliometric Analysis
| Tool/Resource | Function | Application in Environmental Science |
|---|---|---|
| VOSviewer Software | Network visualization and analysis | Creating co-authorship, co-occurrence, and citation networks; identifying research trends [57] |
| Web of Science Core Collection | Primary bibliographic database | Comprehensive coverage of high-impact environmental science literature [36] |
| Scopus Database | Complementary bibliographic database | Expanded journal coverage for more comprehensive analysis [16] |
| CiteSpace Software | Complementary analysis tool | Temporal pattern analysis and burst detection [58] |
| SciMAT Software | Science mapping analysis | Thematic evolution analysis and strategic diagram creation [58] |
| Boolean Search Operators | Query formulation | Precise literature retrieval using logical combinations [36] |
| Viridis Color Scheme | Visualization enhancement | Perceptually uniform coloring for trend visualization [5] |
Bibliometric analysis, particularly when implemented through specialized tools like VOSviewer, provides robust methodological frameworks for shaping research agendas and funding priorities in environmental science. The protocols outlined in this document offer standardized approaches for data collection, analysis, and interpretation that enable evidence-based decision-making in research planning and resource allocation. As environmental challenges continue to evolve in complexity, bibliometric methods will play an increasingly vital role in identifying emerging research fronts, fostering strategic collaborations, and ensuring that funding priorities align with the most pressing scientific and societal needs. The integration of these quantitative approaches with domain expertise represents a powerful paradigm for advancing environmental science in the coming decades.
VOSviewer emerges as an indispensable tool for navigating the complex and rapidly evolving landscape of environmental science research. By mastering its capabilities for foundational exploration, methodological application, troubleshooting, and validation, researchers can systematically decode research trends, identify influential works and collaborations, and pinpoint emerging frontiers—from resilient cities to microplastic pollution. This synthesis of visual bibliometrics and domain expertise not only enriches literature reviews but also actively shapes future research directions and resource allocation. As environmental challenges grow in complexity, the ability to conduct robust, data-driven analyses of scientific literature will be crucial for accelerating discovery and innovation. Future advancements in VOSviewer and integrated bibliometric methods promise even greater precision in tracking the development and impact of environmental research, ultimately contributing to more informed and effective scientific progress.