Troubleshooting: Parameter Search for embedding dimension#
If you were suspecious about the oversimplified look of the function “RP_computer”, you are correct. This function also implements a parameter search method for finding the embedding dimensions under the hood. Even though some default values are there, they may not be applicable to all datasets. In such cases, we need to see, what can be set. We had to do this while analyzing data from Koul et al(2023), and it is for this, we have some functions in “RP_maker_diagnose.py”. We will see how we can use these functions
from RP_maker_diagnose import fnnhitszero_Plot
from RP_maker_diagnose import findm_Plot
from RP_maker_diagnose import RP_diagnose
from RP_maker_diagnose import get_minFNN_distribution_plot
input_path = '/user/swarag/Koul et al/data_npy' # directory to which the signals are saved
diagnose_dir = '/user/swarag/Koul et al/diagnose' # directory in which pickle files from this function are saved
RP_diagnose(input_path, diagnose_dir)
Now, we have saved the picke files to a directory, we can use those pickle files in the next function. Following function estimates the lower and upper bound (2.5th and 975th percentile) of the distribution of minimum false nearest beighbour(FNN) values for different embedding dimensions. It gives a plot and a CSV file.
get_minFNN_distribution_plot(path, 'Koul_et_al_RP_diagnose')