WIP: nomenclature #5

joehand · 2018-02-06T00:24:11Z

This DEP is still work-in-progress.

I've added the summary/motivation and Bryan's questions. I need to put a bit of thought in how to best organize this, in the case we have a lot of terms in here.

We also have existing terms in the Dat documentation: https://docs.datproject.org/terms. I can update/consolidate those to this DEP.

I will try to collect some good examples of other nomeclature/naming convention docs as motivation, if you have suggestions.

pfrazee · 2018-02-06T01:06:10Z

Good call. I share the question about registers.

joehand · 2018-02-06T02:12:07Z

On that topic, I started to try to add some decision making criteria when deciding between a few possible words (realizing now I should make a section for this):

By defining Dat nomenclature, we can ensure the writing of the wider Dat community also uses the preferred terms. ... To reduce barriers to entry, this DEP will prefer words that are less technical while conveying the same meaning.

So, though register is more technically accurate it seems like feed or log may be preferred.

joehand · 2018-03-21T16:16:48Z

Note to self from previous meeting:

discuss syncing/seeding/etc in nomenclature DEP - jhand will formalize terms that need definition

martinheidegger · 2018-11-15T10:03:17Z

Things I miss specified:

Version: What is considered a version in DAT?
Bootstrapping: What do we mean when bootstrapping the network?
Sparse: DAT's can be "sparsely" replicated?!
Checkout: What is considered a checkout?
Live: There are properties in code that refer to something being "live"

... and a link to the terminology used in the dat protocol book: https://github.com/datprotocol/book/blob/a3ca149853b9153c7140876d6f749ecad5c6edbb/src/ch03-01-terminology.md

pfrazee · 2018-11-15T17:51:05Z

I'll offer some definitions here...

Version: Internally every dat data-structure is composed of append-only logs (hypercores). Any time an entry is appended to the log, a new version is created. The version is identified according to the semantics of the data-structure. In the case of single-writer hyperdrive, it's currently being identified by the metadata log's latest message number.
Bootstrapping: This is probably referring to getting connected to the discovery DHT network.
Sparse: Means that the data-set is only partially downloaded/replicated.
Checkout: Viewing a previous version of a dat.
Live: This one is a little vague but usually means "connected to peers and downloading updates as they come."

bnewbold · 2018-11-17T21:14:18Z

I would define a Checkout as a folder containing files from a dat/hyperdrive feed at a specific version (which could be the most recent version; doesn't need to be "previous"). This is distinct from having the same content stored locally in SLEEP files. The terminology comes from git and git checkout.

To clarify Version, it's the integer message number of a hypercore feed. These days, with multi-writer, the term gets a bit more ambiguous because there are multiple feeds, so the version of a hyperdb overall can be an array of (feed, integer) pairs. UX/nomenclature around this will probably need an update for dat-on-multiwriter-hyperdb.

Sparse usually implies that not only is the dataset/feed only partially replicated, but that it's intentionally only partially replicated: the user only wanted, eg, a sub-directory, or only specific versions replicated. I don't think there is clarity/terminology around the case of having "the entire most recent version of a hyperdrive/hyperdb" (eg, full values for all keys/files at the most recent version) but not full history: is that considered Sparse? In conversation i've usually heard people refer to this as the default condition (just having the most recent version), and having the full history of the feed being a speciall "Full History" or "Archival" copy.

I agree with pfrazee on Bootstrapping and Live.

aral · 2018-12-14T22:18:12Z

Suggestion regarding key naming, to strengthen the intent and usage of the keys and remove ambiguity about what abilities various keys grant:

Public key → Read key
Secret key → Write key
Discovery key → Discovery key (unchanged)

This way, people new to the system will not be misled into thinking, for example, that the public key is public (where would they get such an idea?) ;P

And the keys do exactly what they say on the tin.

Example usage:

The Read Key grants read access to a DAT whereas the Write Key is required to write to a DAT. The Discovery Key is used to discover a DAT and it is derived as a hash of the Read Key. The Read Key and Write Key should both be kept secret.

Thoughts?

yoshuawuyts · 2018-12-18T17:10:10Z

@aral I think that's a pretty reasonable suggestion that could remove some ambiguity. I also always mistake "secret key" with "private key", which this would also help solve.

martinheidegger · 2019-01-18T02:14:52Z

@aral I am considering working on an "encrypted DAT". :DATs that are additionally encrypted with yet another key in order to implement proxies/bridges that don't know about the content of a DAT. Do you have any idea how this Key should be called? ;)

aral · 2019-01-18T07:58:37Z

If you mean encrypting the contents of hypercores, I’d say “encryption key” does what it says on the tin.

martinheidegger · 2019-01-18T08:44:06Z

Thanks, naming is hard :-)

martinheidegger · 2019-02-28T00:28:02Z

What entails "sync" in dat sync and "share" in dat share? (Question that came up in chat) Also: how to call a peer that has a write key vs a peer that doesn't?

aral · 2019-02-28T07:02:31Z

Here’s the latest glossary for Hypha, in case it helps – please feel free to use the definitions that apply: https://ar.al/2019/02/18/hypha-glossary/

Regarding the last question in your latest comment, @martinheidegger, authorised vs unauthorised is what I’m using.

martinheidegger · 2019-02-28T14:08:54Z

"authorised" makes sense in a multiwriter context, but in a single-writer context (that will exist in future) it feels weird as there is no way how to every authorise another peer.

RangerMauve · 2019-03-19T02:49:54Z

Pinning Service: A server you can send your dat:// URL to in order for it to replicate your content and stay online. They're useful for making sure content is available in the network. Example: Hasbhase

martinheidegger · 2019-03-19T03:23:17Z

@RangerMauve "pinning service" was also known as "publishing": https://github.com/datproject/dat/blob/master/src/commands/publish.js

registry: A server that can replicate a DAT - usually with a login
publish: Telling a registry to replicate a DAT

martinheidegger · 2019-03-19T03:35:49Z

@RangerMauve Do you think registry/publish is worse than "pinning service"/pinning? (is it the same thing?)

RangerMauve · 2019-03-19T03:39:04Z

I haven't seen registry/publish used outside of datbase, and I haven't seen datbased used much.

I like pinning because it's more descriptive of what's actually happening. A registry/publishing implies some sort of centralization or control. Whereas pinning has more of a "Hey, I'm keeping this around for you" feeling where you're still in control and it's no big deal who's pinning it.

I also like the term "Seeding" since it relates to the BitTorrent world

joehand · 2019-03-19T18:17:53Z

Some of the pinning stuff may be addressed in https://www.datprotocol.com/deps/0003-http-pinning-service-api/

wip draft of nomenclature with summary + motivation

03b8d10

joehand mentioned this pull request Oct 26, 2018

Normalize cross-project terminology dat-ecosystem-archive/book#3

Open

martinheidegger mentioned this pull request Nov 17, 2018

History view dat-ecosystem-archive/dat-desktop#349

Open

martinheidegger mentioned this pull request Mar 19, 2019

20190314 - meeting dat-ecosystem/comm-comm#38

Closed

RangerMauve mentioned this pull request Apr 30, 2019

Added more terms dat-ecosystem-archive/docs#154

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WIP: nomenclature #5

WIP: nomenclature #5

joehand commented Feb 6, 2018 •

edited

Loading

pfrazee commented Feb 6, 2018

joehand commented Feb 6, 2018 •

edited

Loading

joehand commented Mar 21, 2018

martinheidegger commented Nov 15, 2018

pfrazee commented Nov 15, 2018

bnewbold commented Nov 17, 2018

aral commented Dec 14, 2018 •

edited

Loading

yoshuawuyts commented Dec 18, 2018 •

edited

Loading

martinheidegger commented Jan 18, 2019

aral commented Jan 18, 2019

martinheidegger commented Jan 18, 2019

martinheidegger commented Feb 28, 2019

aral commented Feb 28, 2019

martinheidegger commented Feb 28, 2019

RangerMauve commented Mar 19, 2019

martinheidegger commented Mar 19, 2019 •

edited

Loading

martinheidegger commented Mar 19, 2019 •

edited

Loading

RangerMauve commented Mar 19, 2019

joehand commented Mar 19, 2019

WIP: nomenclature #5

Are you sure you want to change the base?

WIP: nomenclature #5

Conversation

joehand commented Feb 6, 2018 • edited Loading

pfrazee commented Feb 6, 2018

joehand commented Feb 6, 2018 • edited Loading

joehand commented Mar 21, 2018

martinheidegger commented Nov 15, 2018

pfrazee commented Nov 15, 2018

bnewbold commented Nov 17, 2018

aral commented Dec 14, 2018 • edited Loading

yoshuawuyts commented Dec 18, 2018 • edited Loading

martinheidegger commented Jan 18, 2019

aral commented Jan 18, 2019

martinheidegger commented Jan 18, 2019

martinheidegger commented Feb 28, 2019

aral commented Feb 28, 2019

martinheidegger commented Feb 28, 2019

RangerMauve commented Mar 19, 2019

martinheidegger commented Mar 19, 2019 • edited Loading

martinheidegger commented Mar 19, 2019 • edited Loading

RangerMauve commented Mar 19, 2019

joehand commented Mar 19, 2019

joehand commented Feb 6, 2018 •

edited

Loading

joehand commented Feb 6, 2018 •

edited

Loading

aral commented Dec 14, 2018 •

edited

Loading

yoshuawuyts commented Dec 18, 2018 •

edited

Loading

martinheidegger commented Mar 19, 2019 •

edited

Loading

martinheidegger commented Mar 19, 2019 •

edited

Loading