About & Methodology
Overview
UTP Publication Lens is an internal bibliometric tool that consolidates, cleans, and visualises Universiti Teknologi PETRONAS’ (UTP) research output. Data are sourced from two major indexing platforms — Scopus (via SciVal) and Web of Science (via InCites & JCR) — and enriched with CiteScore, MyCite, and a list of Scopus-discontinued journals. The dashboard supports internal reporting, benchmarking against world averages, and strategic research planning.
Data Sources
Raw export from Elsevier Scopus. Bibliographic metadata per publication.
SciVal export matched 1-to-1 with Scopus by EID. Primary analytics source.
Elsevier CiteScore 2024 rankings. Assigns Q1–Q4 quartile per publication by source title match.
Clarivate WoS Core Collection export. Citation data and Open Access designations.
Clarivate InCites export matched 1-to-1 with WoS. Normalised impact metrics.
Clarivate JCR 2024. Assigns JIF value and Q1–Q4 quartile to WoS publications via ISSN/eISSN.
Journals removed from Scopus indexing. Publications matched here are flagged is_discontinued = True.
Malaysian Citation Centre (MCC) accredited journal list. Matched publications flagged is_malaysian = True.
Data Processing Pipeline
Scopus Pipeline
scopus.csv and scival.csv loaded; EID normalised (lowercase, stripped whitespace).EID (left join). SciVal contributes FWCI, topics, collaboration type, and subject classifications.scopus_active_sources.csv master map plus keyword overrides (Elsevier, Springer Nature, MDPI, Wiley, IEEE, Taylor & Francis, etc.).citescore.csv; _quartile column (Q1–Q4 / No-Q) added per publication.discontinued.csv (is_discontinued) and mycite.csv (is_malaysian).Web of Science Pipeline
wos.csv (95 cols) loaded; incites.csv shares the same 6,653 rows, keyed on Accession Number.Times Cited, WoS Core provides raw counts for C/P and % cited. InCites CNCI/JNCI give normalised impact.jif.csv; JIF 2024 value and JIF Quartile (Q1–Q4) assigned per publication.Open Access Designations value treated as OA.UTP Affiliation Detection
UTP authors are identified from the Author full names and Authors with affiliations
columns in scopus.csv. Each semicolon-delimited author entry is checked for the keywords
“Universiti Teknologi Petronas” or “PETRONAS” in the affiliation string.
A regex extracts the numeric Scopus Author ID from the name field; the count of unique IDs becomes the
UTP Authors metric.
This heuristic may under-count authors with abbreviated or missing affiliation strings, and may include visiting or dual-affiliation researchers who have UTP listed as a secondary affiliation.
Metric Definitions
Scopus / SciVal
| FWCI | Field-Weighted Citation Impact. Actual citations ÷ expected citations for same type, year, and subject. Benchmark = 1.00. |
| C/P | Total citations ÷ total publications. |
| C/CP | Total citations ÷ number of cited publications only. |
| CiteScore Quartile | Q1–Q4 assigned per publication via source title match to CiteScore 2024. Q1 = top 25% in subject. |
| UTP Authors | Unique Scopus Author IDs with a UTP/PETRONAS affiliation string across all publications. |
| CAGR | Compound Annual Growth Rate of publication count, first to last year. |
WoS / InCites / JCR
| CNCI | Category Normalised Citation Impact. Normalised by document type, year, and research area. Benchmark = 1.00. |
| JNCI | Journal Normalised Citation Impact. Normalised relative to the journal’s expected citation rate. |
| Top 1% / 10% | Papers ranking in the top global percentile by subject area (via InCites Percentile in Subject Area). |
| Avg JIF | Mean Journal Impact Factor (JCR 2024) for publications matched to a JIF-ranked journal via ISSN. |
| JIF Quartile | Q1–Q4 per publication via ISSN/eISSN lookup in JCR 2024. Q1 = top 25%. |
| Open Access | Any publication with a non-empty Open Access Designations value in the WoS export. |
Limitations & Caveats
- Scopus and WoS publication counts differ due to separate indexing scope — they are not directly comparable.
- Author disambiguation relies on Scopus Author IDs; ID merges or name variants can cause under/over-counting.
- Publisher standardisation uses a keyword map; unrecognised variants appear as separate publishers.
- CiteScore quartile uses fuzzy title matching; journals with special characters or variant titles may be unmatched.
- JIF assignment uses ISSN/eISSN lookup; publications in journals absent from JCR 2024 receive no JIF.
- CNCI/JNCI values reflect the InCites export snapshot and change as citations are updated on the platform.
- Discontinued and MyCite flags require valid 8-digit ISSNs; malformed or missing ISSNs will not be flagged.
- All figures are for internal analysis only — users should verify critical numbers against primary platform dashboards.