You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I'm not too sure if this issue sits within dbplyr or RSQLite, but the slice_sample() function does not appear to be taking account of a seed set via set.seed() when using an SQLite database.
If I set a seed, then run slice_sample(), then later set the same seed, and run slice_sample() again, the sampled table is different.
For comparison, when undertaking these sample operations using a Microsoft SQL Server database, the expected behaviour is observed (i.e., the two sampled tables are identical).
We'd need a deterministic version of RANDOM(), which doesn't seem to exist at the time of writing. We'll need either an SQLite extension or a patch in SQLite itself.
I'm not too sure if this issue sits within dbplyr or RSQLite, but the slice_sample() function does not appear to be taking account of a seed set via set.seed() when using an SQLite database.
If I set a seed, then run slice_sample(), then later set the same seed, and run slice_sample() again, the sampled table is different.
For comparison, when undertaking these sample operations using a Microsoft SQL Server database, the expected behaviour is observed (i.e., the two sampled tables are identical).
This has been raised on
dbplyr
as tidyverse/dbplyr#1344The text was updated successfully, but these errors were encountered: