An exploration of cervical cancer risk data using Principle Component Analysis (PCA) and clustering algorithms