– Engineered a high-performance ETL pipeline integrating heterogeneous datasets (2,345 university records, 7,985 economic indicators) through custom data transformation workflows, implementing regex-based string normalization and multi-stage joins with comprehensive data retention across university and country-level indicators.
– Implemented advanced data cleaning methodologies including fuzzy string matching with similarity threshold (0.7), manual country name standardization with custom mapping dictionaries, column-wise median imputation for missing values, and specialized parsing functions to transform complex string-formatted metrics (e.g., converting “48 : 52” to female_ratio and male_ratio).
– Constructed interactive multi-layered dashboards using Tableau’s visualization capabilities, creating bubble charts with size encoding (representing GDP magnitude) and hierarchical displays, revealing significant correlations between economic development levels and university ranking performance.
– Executed rigorous statistical analyses using Python libraries (NumPy, Matplotlib, Seaborn, SciPy) to calculate Pearson correlation coefficients, create log-scaled transformations, and develop regression models with R² values to quantify relationships between institutional resources and performance metrics.
– Conducted comprehensive multivariate analysis across 10 research questions, stratifying universities by categorical variables (political stability: Low<40, Medium 40-69.99, High≥70; gender balance: <10% difference “Balanced”) to evaluate factors affecting university competitiveness with statistically significant findings.
– Architected a sophisticated visualization framework incorporating multiple specialized chart types including heatmaps showing research performance by country, distribution plots comparing student-staff ratios, scatter plots with trend lines, and box plots demonstrating statistical variance across categorical groupings.

Skills: Python (Programming Language) · NumPy · Seaborn · SciPy · Tableau

Leave a Reply

Your email address will not be published. Required fields are marked *

Author

shanghaizhangyijie@gmail.com

Related Posts