Home
Heritage Art
Hyperspectral identification of mineral pigments in Thangka paintings for cultural heritage conservation

Hyperspectral identification of mineral pigments in Thangka paintings for cultural heritage conservation

9 mins

Spectral characteristics of pure pigments

To construct a reliable endmember library and support subsequent modeling, we first analyzed the spectral characteristics of the 12 pure pigments. After removing noisy bands near 400 nm and 1000 nm, all spectra were standardized to 201 wavelength points. Figure 4 presents the average reflectance curves, revealing clear inter-class differences in the 400–900 nm range. Blue pigments, dominated by Azurite, exhibit a reflectance peak near 460 nm and strong absorption between 600–900 nm ^22,23. Green pigments (Malachite) display a distinct absorption band around 800 nm ²³. Yellow pigments (Realgar and Orpiment) show diagnostic absorption features between 470–530 nm; Cinnabar exhibits an inflection near 600 nm ^22,49; and Pearl White is characterized by a nearly flat, high-reflectance profile ²³. These observations are consistent with mineralogical compositions and provide a solid basis for spectral classification. Principal component analysis (PCA, n = 2; Fig. 5) further demonstrates clear class separation, with only partial overlap within blue and green pigments.

**Fig. 4: Reflectance spectra of the 12 pure pigments.**

To evaluate the effect of preprocessing on classification, seven preprocessing strategies and six classifiers were systematically tested using both the full spectral range (201 bands) and a reduced feature set of 20 SHAP-selected wavelengths. As summarized in Table 1, the best-performing combinations achieved high accuracy, and all configurations exceeded 90% (complete results in Table S1, Supplementary Information). Derivative-based strategies (e.g., SGFD, SNVFD, MSCFD) consistently improved model stability, and the SGFD–LDA model showed the most robust performance across folds when using the full spectral range. The confusion matrix in Fig. 6 indicates near-perfect separability, suggesting that hyperspectral information alone is sufficient for reliable pigment classification.

**Fig. 6: Confusion matrix of the SGFD-LDA model for classifying the 12 pure pigment categories.**

Table 1 Summary of the best-performing classification models for the 12 pure pigments, based on five-fold cross-validation accuracy (k = 5)

To enhance computational efficiency while retaining high accuracy, we applied feature selection to identify 20 informative wavelengths. Using SHAP, the rankings were cross-validated with feature importance (FI) and permutation importance (PI), yielding consistent results across folds. The selected wavelengths concentrate mainly in the 400–600 nm region, corresponding to known pigment absorption features ^22,23, thereby supporting physical interpretability. Representative SHAP plots (Fig. 7) confirm strong class-specific discriminative power. Re-evaluation using only the 20 wavelengths maintained high accuracy, with the MSCFD–KNN model achieving the best overall performance (Table 1). This dimensionality reduction preserves predictive capability while substantially lowering computational cost and model complexity; detailed results for the 20-wavelength setting are provided in Table S1. These compact, feature-based models offer a robust foundation for subsequent tasks, including mixture regression and unknown pigment identification, highlighting the practicality of lightweight spectral approaches for cultural-heritage applications.

**Fig. 7: Global feature importance derived from SHAP analysis of the classification model.**

Spectral characteristics of mixed pigments

Figure 8 shows the average reflectance curves of the 54 mixed-pigment samples. The spectra vary consistently with changing mixing ratios. For combinations such as Blue–Realgar, Blue–Cinnabar, Green–Realgar, and Green–Cinnabar, gradual shifts in reflectance profiles emerge as the dominant endmember proportion increases, reflecting a smooth transition of spectral influence across 400–1000 nm. In contrast, mixtures containing Pearl White primarily exhibit increased overall reflectance while maintaining spectral shape, confirming its role as a brightness-enhancing background pigment rather than a strong spectral modifier. These observations indicate that the behavior of mixed pigments is governed not only by relative proportion but also by intrinsic spectral strength. Deviations from linear blending, particularly in copper- and mercury-based pigments, suggest nonlinear interactions associated with scattering effects, layer thickness, or binder composition; these tendencies align with prior heritage spectroscopy reports ^22,23. Consequently, regression-based models are well-suited to capture subtle compositional dependencies beyond simple linear interpolation.

**Fig. 8: Spectral curves of six binary pigment mixtures across nine mixing ratios (1:9 to 9:1).**

We compared three regression models—PLSR, PCR, and SVR—under seven preprocessing strategies across six pigment pairs, evaluating performance using R², RMSE, MAE, and RPD (Table 2). Overall, PLSR consistently outperformed PCR and SVR, demonstrating superior robustness and predictive stability. The SGFD–PLSR configuration provided the best results, with average R² = 0.98, RMSE = 0.04, and RPD = 7.20 across all groups; predicted ratios closely matched ground truth (Fig. 9), supporting its use as the backbone of the conditional regression module. Negative R² values observed under MSC and MSCFD indicate that these corrections can distort spectral structure, likely due to nonlinear scattering behaviors of mineral pigments that depart from MSC’s linear assumptions, thereby weakening the relationship between reflectance and mixing ratio. These findings underscore the need to pair regression algorithms with preprocessing choices that balance robustness, interpretability, and computational efficiency.

Table 2 Regression performance for six pigment groups using seven preprocessing techniques and three regression algorithms

To reduce redundancy while maintaining accuracy, VIP-based wavelength selection was integrated into the SGFD–PLSR pipeline. For each pigment group, the 20 most influential wavelengths were extracted from training data and used to rebuild the regression models (Fig. 10). As summarized in Table 3, performance remained strong after dimensionality reduction: all R² values exceeded 0.94, with an overall average of 0.975; RMSE and RPD also stayed within acceptable ranges, indicating that key diagnostic information was retained. The Blue–Cinnabar group showed slightly lower accuracy (R² = 0.95), likely due to overlapping signatures and similar inflection points, but errors remained modest. Taken together, mixed-pigment analysis highlights both the challenges of nonlinear spectral interactions and the effectiveness of regression-based modeling. Among the evaluated options, PLSR—especially when combined with SGFD preprocessing and VIP-based feature selection—offers a reliable and computationally efficient solution for mixing-ratio estimation in practical pigment-identification workflows.

Table 3 Prediction performance of the SGFD-PLSR model using VIP-selected wavelengths for six pigment mixture groups

Evaluation of spectral unmixing methods

Assuming prior knowledge of pigment-mixture types, we first constructed local unmixing models using endmember spectra for the six predefined combinations. Three linear algorithms (FCLS, Tikhonov-FCLS, SUnSAL) and three nonlinear algorithms (GBM, PPNMM, FM) were applied to estimate abundance ratios, with performance summarized in Table 4. RMSE was used as the primary evaluation metric. Overall, the local methods yielded preliminary abundance estimates without supervised training and offered strong physical interpretability. However, quantitative accuracy was limited: the best-performing model (Tikhonov-FCLS) reached a mean RMSE of 0.179, substantially higher than the SGFD–PLSR regression model in “Spectral Characteristics of Mixed Pigments” (RMSE = 0.037). Nonlinear models (GBM, PPNMM) produced only modest gains over linear methods, consistent with predominantly weakly nonlinear or locally linear mixing. In addition, the lack of explicit spectral-variability modeling made traditional unmixing more sensitive to background interference and noise. Local unmixing remains a valuable, physically interpretable baseline, but its accuracy depends strongly on prior endmember knowledge and is insufficient for precise quantification.

Table 4 Results of six unmixing models for six pigment mixtures under local unmixing

To explore identification without prior knowledge, we next evaluated a global unmixing strategy. A reduced endmember library of five representative pigments (Blue 1, Green 1, Realgar, Cinnabar, Pearl White) was assembled, and Tikhonov-FCLS was applied to all 54 mixtures to infer both endmember combinations and abundances (Table 5). The global approach achieved only moderate success: approximately 74% of mixtures were assigned correct endmember sets, with substantial variation across pigment groups. For example, Blue–Realgar and Blue–Cinnabar were often correctly identified, whereas Green–Cinnabar samples were frequently misclassified as Blue–Realgar, reflecting spectral similarities between copper- and mercury-based pigments. Abundance estimates also deviated; for instance, a true 9:1 Green: Pearl White mixture was misidentified as 0.92:0.08 Green: Blue, illustrating how background reflectance and endmember confusion can distort quantitative predictions. These results highlight the limitations of global unmixing in realistic heritage scenarios: spectral similarity, nonlinear interactions, and background interference collectively reduce recognition accuracy and quantitative reliability. While global unmixing shows a degree of autonomous recognition, its precision falls short of practical requirements for Thangka pigment analysis, underscoring the need for data-driven, multi-stage frameworks that integrate classification with conditional regression under unknown conditions.

Table 5 Endmember identification and abundance estimation by the Global Tikhonov-FCLS Model

Application of the multi-stage strategy to a real Thangka

In the preceding sections, classification and regression models were systematically benchmarked on standardized laboratory-prepared samples. Multiple configurations achieved accuracies above 98% for pure-pigment classification, while the SGFD–PLSR model yielded an average R² = 0.975 for mixture-ratio prediction (Table 3), substantially outperforming conventional spectral unmixing methods. These results establish the proposed multi-stage framework as a competitive approach that integrates pure/mixed discrimination, subclass classification, and conditional regression within a unified workflow. To assess applicability under realistic conditions, the framework was validated on a hand-painted Thangka image. For comparison, a widely used prior-free spectral matching method—Spectral Angle Mapper (SAM)—was also evaluated, enabling a fair contrast between supervised and unsupervised strategies in practical cultural-heritage scenarios (workflow in Fig. 3).

The framework was tested on a real Thangka (Baoluo Foshou) using eight annotated regions of interest (ROIs; Fig. 11) covering both pure pigments and mixtures. The classification outcomes are summarized in Table 6, with an overall accuracy of 75%. Most ROIs were correctly identified, indicating that the framework can generalize beyond laboratory samples. Misclassifications occurred in two ROIs and are discussed below. Given the small number of ROIs, this validation should be regarded as a proof of concept rather than comprehensive evidence of generalizability.

**Fig. 11: Validation on a real Thangka painting (Baoluo Foshou).**

Table 6 Validation results of the multi-stage strategy on a real Thangka (Baoluo Foshou)

Pure pigments were reliably identified, whereas mixtures showed greater variability. For mixtures, the average RMSE of ratio estimation was 0.17, with one ROI exhibiting an error below 0.05. ROI 5 (Blue 1:Realgar) displayed a higher RMSE of 0.31 despite correct class assignment, likely due to uneven pigment distribution and strong scattering contrasts between blue and yellow components, which introduced local spectral nonlinearity and affected quantitative accuracy. The two misclassifications were mainly attributable to spectral similarity between closely related pigments (ROI 1: Blue 1 predicted as Blue 2; see Fig. 4) and insufficient pigment thickness that allowed canvas background interference (ROI 8: Green 1 predicted as Green 1:Pearl White).

For the SAM baseline, the overall classification accuracy also reached 75% (Table 7), correctly identifying several mixtures but without the ability to provide continuous ratio estimates. Misclassifications again appeared in ROI 1 (Blue 1 predicted as Blue 2), reflecting strong spectral similarity among azurite-based pigments, and ROI 8 (Green 1 predicted as Green 3), highlighting confusion among closely related greens when intra-class variability is not well represented in the reference library.

Table 7 Validation results of the SAM method on a real Thangka (Baoluo Foshou)

Both approaches achieved comparable recognition rates on this real Thangka. However, the multi-stage framework additionally provides continuous abundance estimates, shows greater robustness to spectral noise, and scales more readily to larger pigment libraries. A detailed comparative discussion and implications for conservation practice are presented in “Discussion”.

Summary of results

Overall, the results demonstrate that pure pigments exhibit highly distinctive spectral signatures, enabling near-perfect classification under controlled conditions. Mixture analyses further confirmed the effectiveness of regression-based modeling, with the SGFD-PLSR configuration consistently outperforming conventional unmixing approaches. Feature selection markedly reduced data dimensionality while maintaining predictive accuracy, supporting the development of lightweight yet interpretable models for practical use. Validation on a real Thangka painting achieved an overall accuracy of 75%, underscoring both the feasibility of the proposed framework and the challenges inherent to analyzing complex, aged artworks.

Collectively, these findings validate the potential of the proposed multi-stage recognition framework for practical pigment identification in cultural heritage applications, while also revealing critical limitations that merit further investigation. The following Discussion section elaborates on the implications of these results, addresses the observed limitations, and outlines directions for future research.

Source link

Hyperspectral identification of mineral pigments in Thangka paintings for cultural heritage conservation

Spectral characteristics of pure pigments

Spectral characteristics of mixed pigments

Evaluation of spectral unmixing methods

Application of the multi-stage strategy to a real Thangka

Summary of results

Related Posts

Heritage on the edge: new Google project reveals climate change damage to Unesco sites – The Art Newspaper

West Cumbria’s industrial heritage celebrated in art exhibition

News18 India Utsav- Launched by News18 India, the platform will promote art and culture in India.

Does LACMA Have a Looted Art Problem?

Latest updates

Federation of British Artists welcomes former English Heritage region director as CEO

The Mauritius Art and Culture Investment Summit 2026: A Critical Appraisal

UAE-Based Indian Artists Reimagine Historic UN Peace Hymn “Maithreem Bhajata” for a Divided World

The Blogs: The Art of Surviving and Surviving of Art | Inna Rogatchi

Sections

Daily Updates

Federation of British Artists welcomes former English Heritage region director as CEO

The Mauritius Art and Culture Investment Summit 2026: A Critical Appraisal

Weekly Updates

Paintings Show The Subtle Difference Between Sunrise And Sunset

Investing in Fine Art Made Simple

Most Read

The ten most expensive Vincent van Gogh paintings – The Art Newspaper

Outstanding art advisers 2024 – Spear’s

Picked

Federation of British Artists welcomes former English Heritage region director as CEO

The Mauritius Art and Culture Investment Summit 2026: A Critical Appraisal

UAE-Based Indian Artists Reimagine Historic UN Peace Hymn “Maithreem Bhajata” for a Divided World

The Blogs: The Art of Surviving and Surviving of Art | Inna Rogatchi

This young woman created 784 paintings while hiding from the Nazis