Lifted structure loss #340

Lorenzobattistela · 2023-07-15T15:48:42Z

This PR refers to Issue #102 and it is implementing the LSL described in the article https://arxiv.org/pdf/1511.06452.pdf

* Add initial version of pn_loss * Add basic tests for pn_loss

13.2

* Update squared L2. * Version bump to 13.3 * Remove \r chars from notebooks. Was breaking the jupyterlabs editor.

13 fix

Reviewed live * Samplers refactoring * efficientnet better args

…nsorflow#115)

* Update squared L2. * Version bump to 13.3 * Remove \r chars from notebooks. Was breaking the jupyterlabs editor. * Add lifted struct and multisim loss. * Clean up doc strings in loss functions. * Add more specific FloatTensor and IntTensor types to samplers. * Remove old losses.py module. * Update multisim loss to include the similarity-P filtering and work with distance instead of similarity. * Change InnerProductDistance to InnerProductSimilarity to make it clear that this does not function like the other distances. * Update Circle Loss to work with distance instead of similarity. * Create custom LogSumExp to support masking and adding 1 to ensure positive loss * Fix bug in valid_anchors where transpose won't work because the shape is [m]. Revert back to reshape. * Fix mypy errors * Remove lifted_struct_loss from losses init * Add period in comment

Fix tensorflow to 2.5 to avoid breaking tests in 2.6

…rflow#119)

owenvallis · 2023-07-15T18:20:22Z

Thanks for the PR! Looking forward to adding this to the tfsim losses. I left some comments on the loss function.

tensorflow_similarity/losses/lifted_structure_loss.py

Lorenzobattistela · 2023-07-15T22:05:42Z

@owenvallis implemented the changes requested, although i'm a little lost about the following snippet:

# Get negative distances
    negative_dists, _ = negative_distances(
        negative_mining_strategy, pairwise_distances, negative_mask
    )

since both of results are not being used anywhere else

owenvallis · 2023-07-15T22:30:31Z

@owenvallis implemented the changes requested, although i'm a little lost about the following snippet:
# Get negative distances
    negative_dists, _ = negative_distances(
        negative_mining_strategy, pairwise_distances, negative_mask
    )
since both of results are not being used anywhere else

Good point! Looks like we're missing the columns-wise concat in the logsumexp. You had it in the previous commit, so pulling that back in I think it should be:

    # Reorder pairwise distances and negative mask based on positive indices
    reordered_pairwise_distances = tf.gather(negative_distances, positive_indices, axis=1)
    reordered_negative_mask = tf.gather(negative_mask, positive_indices, axis=1)

    # Concatenate pairwise distances and negative masks along axis=1
    concatenated_distances = tf.concat([pairwise_distances, reordered_pairwise_distances], axis=1)
    concatenated_negative_mask = tf.concat([negative_mask, reordered_negative_mask], axis=1)

    # Compute (margin - neg_dist) logsum_exp values for each row (equation 4 in the paper)
    neg_logsumexp = tfsim_losses.utils.logsumexp(margin - concatenated_distances, concatenated_negative_mask)

Lorenzobattistela · 2023-07-16T13:43:03Z

@owenvallis implemented changes discussed, and also fixed for flake8, I'll try writing unit tests for the loss too. ALready subscribed for CLA/google action too

owenvallis · 2023-07-16T17:41:47Z

Thanks! Looks like the CLA is working for the current commits, but you will need to update the author on the older commit.

Let me know once the tests are ready and I'll run the checks and merge the PR.

owenvallis · 2023-07-16T17:51:14Z

tensorflow_similarity/losses/lifted_structure_loss.py

+        positive_mining_strategy, pairwise_distances, positive_mask
+    )
+
+    # Get negative distances


Looking at this again, I think we can remove lines 52-54 and use pairwise_distances in the tf.gather on line 57.

My concern is that the negative mining strategy is meant to return a single negative per row, but here we really want to keep all the negatives.

wdyt?

Yeah I mean, it also makes more sense to do this in tf.gather so we keep consistency (so something like reordered_pairwise_distances = tf.gather(pairwise_distances, positive_indices, axis=1)) which also agrees with reordered_pairwise_distances. As fair as I understand by reading the paper, I do think we want to keep all the negatives.

Do you think it makes sense to remove neg mining strategy at all?

Lorenzobattistela · 2023-07-17T11:28:42Z

Also I keep getting this error when writing tests:

FAILED tests/losses/test_lifted_structure_loss.py::TestLiftedStructLoss::test_lifted_struct_loss_test_mode_eager - TypeError: lifted_struct_loss() got multiple values for argument 'distance'
FAILED tests/losses/test_lifted_structure_loss.py::TestLiftedStructLoss::test_lifted_struct_loss_test_mode_graph - TypeError: in user code:

for a test like this (just example one, not the correct):

import tensorflow as tf
from absl.testing import parameterized
from tensorflow.python.framework import combinations
from tensorflow.keras.losses import Reduction
from tensorflow_similarity import losses
from . import utils

@combinations.generate(combinations.combine(mode=["graph", "eager"]))
class TestLiftedStructLoss(tf.test.TestCase, parameterized.TestCase):
    def test_config(self):
        lifted_obj = losses.LiftedStructLoss(
            reduction=Reduction.SUM,
            name="lifted_loss",
            distance="cosine",
        )
        self.assertEqual(lifted_obj.distance.name, "cosine")
        self.assertEqual(lifted_obj.name, "lifted_loss")
        self.assertEqual(lifted_obj.reduction, Reduction.SUM)

    def test_lifted_struct_loss(self):
        """Tests the LiftedStructLoss with different parameters."""

        labels = tf.constant([1, 2, 2, 1], dtype=tf.int32)
        embeddings = tf.random.normal([4, 10])
        expected_loss = 1

        lifted_obj = losses.LiftedStructLoss(reduction=Reduction.SUM)
        loss = lifted_obj(labels, embeddings)

        self.assertAlmostEqual(loss, expected_loss)

owenvallis · 2023-07-27T05:42:25Z

Interesting, I'll have to look into that more. Maybe the kwargs also contain the distance and that is getting passed twice in the init call or something? I'll try and take a look at it tomorrow if I get a chance.

Lorenzobattistela · 2023-07-27T12:33:58Z

havent thought about that, will try to do some stuff too

Lorenzobattistela · 2023-07-29T02:34:36Z

Had some progress today, we have some problems in the implementation and the distance thing was missing arguments that were coming from call. Working on it, expect to push some things tomorrow

ops closed sorry, did not meant to lol

Lorenzobattistela · 2023-07-29T02:48:45Z

@owenvallis just implemented some sample tests, now they're running fine, errors are fixed although the loss values are really weird, so im not sure they are correct. Also did not finish to implement test cases.

review-notebook-app · 2023-07-29T11:11:19Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Lorenzobattistela · 2023-07-29T11:27:02Z

new pr #342

ebursztein and others added 30 commits July 28, 2021 20:57

small fixes and getting ready for pypi (tensorflow#105)

888a2dc

moving to official pypi

e7f4d3b

pipy attempt tensorflow#2

f905c78

fixing official classifier

b81d42e

switching to standard TF documentation

944f6be

Disable documentation

5d7b681

Pn loss new (tensorflow#107)

1b0442e

* Add initial version of pn_loss * Add basic tests for pn_loss

refactoring losses + initial circle.

2a46ca1

circleloss working

5e9eb7c

Merge pull request tensorflow#110 from tensorflow/13.2

e298788

13.2

Update squared L2. (tensorflow#111)

248ba64

* Update squared L2. * Version bump to 13.3 * Remove \r chars from notebooks. Was breaking the jupyterlabs editor.

examples fixed

72a6eef

readme fix

f103ede

renaming documentation

d40c18b

bunp minor

74d90fb

mac dsstore ignore

42b153b

Merge pull request tensorflow#112 from tensorflow/13-fix

756c537

13 fix

Samplers refactoring

c2ff1fb

efficientnet better args

794706c

Samplers refactoring (tensorflow#114)

69b98cb

Reviewed live * Samplers refactoring * efficientnet better args

Benchmark generation

72f1612

bench genration and train done

9c63a05

Update up metrics

ac6e918

Patch to fix mypy type error and remove the old losses.py module. (te…

7bbe428

…nsorflow#115)

Merge branch 'master' into metrics

5325b9f

Update test.yml

bff0384

fix install deps for publish workflow (tensorflow#118)

1f99c56

Fix tensorflow to 2.5 to avoid breaking tests in 2.6

Merge branch 'master' into metrics

83988a4

Replace \r\n with \n to fix editor sync errors in jupyterlabs. (tenso…

7a24276

…rflow#119)

owenvallis requested changes Jul 15, 2023

View reviewed changes

[feat] changes requested in LSL

33768fc

Lorenzobattistela added 3 commits July 16, 2023 10:24

[fixes] concatenation of distances to logsumexp

dbf3cfa

[lint] flake8 fixes

003a6ef

deleting wrong test case

9b73ff5

owenvallis requested changes Jul 16, 2023

View reviewed changes

Lorenzobattistela added 3 commits July 17, 2023 08:01

[fix] rmeove neg_distances and use pairwise to keep all negatives

33c2e4b

[chore] remove unused import

44f03ba

[fix] removing neg mining strategy

e6d323a

owenvallis previously approved these changes Jul 27, 2023

View reviewed changes

Lorenzobattistela closed this Jul 29, 2023

Lorenzobattistela reopened this Jul 29, 2023

Lorenzobattistela added 3 commits July 28, 2023 23:39

[fix] add missing args coming from call and missing casting

7c33435

[test] testing lifted struct loss

70d8a78

[test] lsl testing

004c7f5

Lorenzobattistela dismissed owenvallis’s stale review via 412e8d4 July 29, 2023 02:47

Lorenzobattistela force-pushed the lifted-structure-loss branch from 412e8d4 to 004c7f5 Compare July 29, 2023 11:11

Lorenzobattistela closed this Jul 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Lifted structure loss #340

Lifted structure loss #340

Lorenzobattistela commented Jul 15, 2023

owenvallis commented Jul 15, 2023

Lorenzobattistela commented Jul 15, 2023

owenvallis commented Jul 15, 2023

Lorenzobattistela commented Jul 16, 2023

owenvallis commented Jul 16, 2023

owenvallis Jul 16, 2023

Lorenzobattistela Jul 17, 2023 •

edited

Loading

Lorenzobattistela commented Jul 17, 2023

owenvallis commented Jul 27, 2023

Lorenzobattistela commented Jul 27, 2023

Lorenzobattistela commented Jul 29, 2023 •

edited

Loading

Lorenzobattistela commented Jul 29, 2023

review-notebook-app bot commented Jul 29, 2023

Lorenzobattistela commented Jul 29, 2023

Lifted structure loss #340

Lifted structure loss #340

Conversation

Lorenzobattistela commented Jul 15, 2023

owenvallis commented Jul 15, 2023

Lorenzobattistela commented Jul 15, 2023

owenvallis commented Jul 15, 2023

Lorenzobattistela commented Jul 16, 2023

owenvallis commented Jul 16, 2023

owenvallis Jul 16, 2023

Choose a reason for hiding this comment

Lorenzobattistela Jul 17, 2023 • edited Loading

Choose a reason for hiding this comment

Lorenzobattistela commented Jul 17, 2023

owenvallis commented Jul 27, 2023

Lorenzobattistela commented Jul 27, 2023

Lorenzobattistela commented Jul 29, 2023 • edited Loading

Lorenzobattistela commented Jul 29, 2023

review-notebook-app bot commented Jul 29, 2023

Lorenzobattistela commented Jul 29, 2023

Lorenzobattistela Jul 17, 2023 •

edited

Loading

Lorenzobattistela commented Jul 29, 2023 •

edited

Loading