Skip to content

This repository contains the scripts that were used for my master thesis project, titled 'Implementing data science and machine learning algorithms to extract morphological information from histopathology slides of DLBCL tissue'

Notifications You must be signed in to change notification settings

DarwinsFinch/Master-thesis-Lunas-Hahn-

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

18 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Master-thesis-Lunas-Hahn-

This repository contains the scripts that were used for my master thesis project, titled 'Implementing data science and machine learning algorithms to extract morphological information from histopathology slides of DLBCL tissue'

The repository includes the follwoing scripts;

Step 1:

MA_Lunas_Hahn_create_df_with_interpolated_data_points_for_all_cells.py MA_Lunas_Hahn_create_df_with_interpolated_data_points_for_labeled_cells.py

This is used to interpolate the contour points from each cell, so that there are 50 points for each single cell. Further, the script creates a csv file that includes the cells name, label, number, and contour points.

Step 2:

MA_Lunas_Hahn_get_single_cells_aligned_and_masked_for_labeled_and_unlabeled.py

This is used to get the single cell images out of the whole slide image

Step 3:

MA_Lunas_Hahn_construct_CV_and_PCA_features_for_all_cells.py MA_Lunas_Hahn_construct_CV_and_PCA_features_for_labeled_cells.py

This is used to calculate the different CV features and do the PCA on the contour points.

Step 4:

MA_Lunas_Hahn_CV_and_PCA_features_analysis.py

This is used to scale and analyse the CV and PCA feautures; With this I determine the pearson correlation, use PCA, t-SNE and UMAP and train the RF and Lasso regression model to evaluate the meaningfulness of the features. It also combines both feature vector and check the meaningfulness for this vector with the same approach as above.

Step 5:

MA_Lunas_Hahn_train_AE_and_analyse_the_AE_features_and_do_AE_plus_CV_feature_analysis.py

This builds and trains the AE on the labeled single cell images. It also uses the latent as a feature vector and does PCA, t-SNE and UMAP as well as RF and Lasso with this vector. It also combines the AE and CV features and uses this vector for the same ML approaches.

Step 6:

MA_Lunas_Hahn_train_VAE_and_evaluate_it.py

This builds and trains the VAE using a cyclic weighting of the KLD-loss.

About

This repository contains the scripts that were used for my master thesis project, titled 'Implementing data science and machine learning algorithms to extract morphological information from histopathology slides of DLBCL tissue'

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages