Wrap_up_plot fails when it has to handle unsuccessful runs #5

rcap107 · 2024-01-04T18:10:25Z

wrap_up_plot fails to vstack elements because some columns have dtype utf-8 instead of f64: these columns are created when the corresponding run fails, so it's caused by the poor handling of failed runs.

    161 def wrap_up_plot(exp_name, task="regression", variable_of_interest=None):                                                                             
    162     """Prepare and save the plots relevant to the task under consideration.                                                                           
    163     If the task is `regression`, plot `r2score`, if the task is `classification`,                                                                     
    164     plot `f1score`.                                                                                                                                   
   (...)                                                                                                                                                      
    169         `classification`. Defaults to "regression".                                                                                                   
    170     """                                                                                                                                               
--> 171     df_raw = read_logs(exp_name=exp_name)                                                                                                             
    173     if task == "regression":                                                                                                                          
    174         current_score = "r2score"                                                                                                                     
                                                                                                                                                              
File ~/work/benchmark-join-suggestions/src/utils/logging.py:75, in read_logs(exp_name, exp_path)                                                              
     73 for f in path_agg_logs.glob("*.log"):                                                                                                                 
     74     logs.append(pl.read_csv(f))                                                                                                                       
---> 75 df_agg = pl.concat(logs)                                                                                                                              
     77 return df_agg                                                                                                                                         
                                                                                                                                                              
File ~/mambaforge/envs/bench-repro/lib/python3.10/site-packages/polars/functions/eager.py:170, in concat(items, how, rechunk, parallel)                       
    168 if isinstance(first, pl.DataFrame):                                                                                                                   
    169     if how == "vertical":                                                                                                                             
--> 170         out = wrap_df(plr.concat_df(elems))                                                                                                           
    171     elif how == "vertical_relaxed":                                                                                                                   
    172         out = wrap_ldf(                                                                                                                               
    173             plr.concat_lf(                                                                                                                            
    174                 [df.lazy() for df in elems],                                                                                                          
   (...)                                                                                                                                                      
    178             )                                                                                                                                         
    179         ).collect(no_optimization=True)                                                                                                               
                                                                                                                                                              
ShapeError: unable to vstack, dtypes for column "time_join_train" don't match: `f64` and `str`

The text was updated successfully, but these errors were encountered:

rcap107 · 2024-01-04T18:10:41Z

Related to: #4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Wrap_up_plot fails when it has to handle unsuccessful runs #5

Wrap_up_plot fails when it has to handle unsuccessful runs #5

rcap107 commented Jan 4, 2024

rcap107 commented Jan 4, 2024

Wrap_up_plot fails when it has to handle unsuccessful runs #5

Wrap_up_plot fails when it has to handle unsuccessful runs #5

Comments

rcap107 commented Jan 4, 2024

rcap107 commented Jan 4, 2024