Fine-Tuning EfficientNetB0 with pretrained 'imagenet' is not reproducIble (i.e,. saving model and loading) #797

Qasim-Latrobe · 2024-08-29T02:17:48Z

-- tensorflow 2.16.1
-- Similar behavior is observed in tensorflow 2.17.0

I am encountering an issue with fine-tuning an EfficientNetB0 model that was originally pretrained on ImageNet.

Model Training and Fine-Tuning: I start with an EfficientNetB0 model pretrained on ImageNet and fine-tune it on my specific dataset.

Saving the Model: After fine-tuning, I save the model using model.save() with the .keras format.

Loading the Model: When I later load the model using load_model(), the performance of the model does not match the performance achieved during the fine-tuning phase. The results appear to be inconsistent or random.

I am initializing random states through seed for reproducibility.

from tensorflow.keras.applications import EfficientNetB0
from tensorflow.keras.models import Model
from tensorflow.keras.layers import Dense, GlobalAveragePooling2D
import tensorflow as tf

Set seed for reproducibility

tf.random.set_seed(42)
np.random.seed(42)

Define and compile the model

base_model = EfficientNetB0(weights='imagenet', include_top=False, input_shape=(224, 224, 3))
x = base_model.output
x = GlobalAveragePooling2D()(x)
x = Dense(1024, activation='relu')(x)
predictions = Dense(10, activation='softmax')(x) # Adjust number of classes
model = Model(inputs=base_model.input, outputs=predictions)

model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

Train and fine-tune the model

model.fit(x_train, y_train, epochs=10)

Save the fine-tuned model

model.save('fine_tuned_model.keras')

Load the model later

loaded_model = tensorflow.keras.models.load_model('fine_tuned_model.keras')

Evaluate performance

results = loaded_model.evaluate(x_test, y_test)

I have observed that when saving the model weights in HDF5 format (.h5) and subsequently loading them within the same session, the validation performance is consistently reproduced. However, when the .h5 model weights are loaded in a different session, the validation performance does not match the original results, and returns random accuracies.

Additionally, when using EfficientNetB0 with pretrained weights set to 'None', the model's performance remains consistent regardless of the session, and it can reproduce its performance.

Other models such as ResNet50, VGG16 runs as expected, only the efficientNetBx are having this issue.

Qasim-Latrobe · 2024-08-29T06:08:03Z

I have managed to resolve part of the issue. The problem stemmed from my use of the Keras ModelCheckpoint callback (https://keras.io/api/callbacks/model_checkpoint/) during training, which was inadvertently overwriting the saved file. After removing this callback, the model's behavior aligns with expectations. However, I remain puzzled as to why the callback is saving incorrect model settings. I am still exploring to find a possible solution because my objective is to save the weights corresponding to the best-performing model which can be done using modelCheckpoint.

ghsanti · 2024-08-29T18:09:47Z

Hi @Qasim-Latrobe,

I got stuck today while trying to create a full repro and found another bug that is now fixed.

Does the problem still persist?
Could you try to create a colab where you fine tune with cifar 10 or cifar 100 so we can test?

If you do, please install the path to master.

Otherwise just use keras nightly and any other datasets.

If you can't I'll try tomorrow.

Qasim-Latrobe · 2024-08-30T01:14:48Z

Hi @ghsanti,

Thanks for a prompt response. I believe there is a possible bug in the Keras ModelCheckpoint callback, which triggers only with efficientnetBx models. I am able to work around by writing a manual ModelCheckpoint callback.

ghsanti · 2024-08-30T10:34:25Z

@Qasim-Latrobe

This Gist uses CIFAR10.

No difference shows up though (last line in the image is the evaluation step.):

Qasim-Latrobe · 2024-08-30T22:55:33Z

@ghsanti

Thank you for your efforts in understanding and reproducing the issue. I will explore the possibility of resolving the problem using tf-nightly, as you suggested. At this point, I have no further comments, as I have implemented a workaround. Thank you once again for your assistance.

ghsanti · 2024-08-31T09:12:34Z

Nightly was used to grab a fix for cifar10 dataset (and testing whether it works). It's not mandatory.

Using stable versions like tensorflow=~2.17.0, keras~=3.5.0 should do. (just pick another dataset, or use a custom one.)

('nightly' is not a fixed version, so you could see breaking changes further down your project.)

Qasim-Latrobe added the type:bug label Aug 29, 2024

github-actions bot assigned tilakrayal Aug 29, 2024

Qasim-Latrobe closed this as completed Aug 30, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fine-Tuning EfficientNetB0 with pretrained 'imagenet' is not reproducIble (i.e,. saving model and loading) #797

Fine-Tuning EfficientNetB0 with pretrained 'imagenet' is not reproducIble (i.e,. saving model and loading) #797

Qasim-Latrobe commented Aug 29, 2024 •

edited

Loading

Qasim-Latrobe commented Aug 29, 2024

ghsanti commented Aug 29, 2024 •

edited

Loading

Qasim-Latrobe commented Aug 30, 2024 •

edited

Loading

ghsanti commented Aug 30, 2024 •

edited

Loading

Qasim-Latrobe commented Aug 30, 2024

ghsanti commented Aug 31, 2024

Fine-Tuning EfficientNetB0 with pretrained 'imagenet' is not reproducIble (i.e,. saving model and loading) #797

Fine-Tuning EfficientNetB0 with pretrained 'imagenet' is not reproducIble (i.e,. saving model and loading) #797

Comments

Qasim-Latrobe commented Aug 29, 2024 • edited Loading

Set seed for reproducibility

Define and compile the model

Train and fine-tune the model

model.fit(x_train, y_train, epochs=10)

Save the fine-tuned model

Load the model later

Evaluate performance

Qasim-Latrobe commented Aug 29, 2024

ghsanti commented Aug 29, 2024 • edited Loading

Qasim-Latrobe commented Aug 30, 2024 • edited Loading

ghsanti commented Aug 30, 2024 • edited Loading

Qasim-Latrobe commented Aug 30, 2024

ghsanti commented Aug 31, 2024

Qasim-Latrobe commented Aug 29, 2024 •

edited

Loading

ghsanti commented Aug 29, 2024 •

edited

Loading

Qasim-Latrobe commented Aug 30, 2024 •

edited

Loading

ghsanti commented Aug 30, 2024 •

edited

Loading