Skip to content

Commit

Permalink
Finish week 23 fri
Browse files Browse the repository at this point in the history
  • Loading branch information
akirataguchi115 committed Jun 7, 2024
1 parent 7219932 commit b5ac19d
Show file tree
Hide file tree
Showing 5 changed files with 59 additions and 6 deletions.
2 changes: 2 additions & 0 deletions Ch.00_Abstract.tex
Original file line number Diff line number Diff line change
Expand Up @@ -27,4 +27,6 @@
\section*{Acknowledgements}
much love to suvi, artemis, sami nurmivaara, prof männistö and prof mäntylä

thanks to def for borrowing gpt4. thanks to rashid and barunes for sending me software licensing related videos and news.

dedicated to suvi <3
15 changes: 13 additions & 2 deletions Ch.20_Methods.tex
Original file line number Diff line number Diff line change
Expand Up @@ -79,7 +79,9 @@ \section{Search process}
\end{figure}


In the first stage, the search was conducted using the ''SPDX License List'' \citep{spdx:licenses}, ''The DFSG and Software Licenses'' \citep{debian:dfsg}, FSF's ''Category:License'' Wiki page \citep{fsf:licenselist}, GNU's ''Various Licenses and Comments about Them'' \citep{gnu:licenselist} and "OSI Approved licenses" \citep{osi:licenselist}. The initial list of PCLs excluding duplicates is provided in \hyperref[appendix:a]{Appendix A}
In the first stage, the search was conducted using the ''SPDX License List'' \citep{spdx:licenses}, ''The DFSG and Software Licenses'' \citep{debian:dfsg}, FSF's ''Category:License'' Wiki page \citep{fsf:licenselist}, GNU's ''Various Licenses and Comments about Them'' \citep{gnu:licenselist} and "OSI Approved licenses" \citep{osi:licenselist}. The PCLs appear in the same order as decribed above: SPDX, DFSG, FSF, OSI and GNU. The appendix was also crafted in a spreadsheet software so that only the initial hit source was documented in the order described above. For example even if MIT license would be found on SPDX and DFSG \hyperref[appendix:a]{Appendix A} would only display MIT license with the ''First hit from'' value being SPDX. The initial list of 789 PCLs excluding duplicates is provided in \hyperref[appendix:a]{Appendix A}.

Some things must be mentioned about the process of the first stage. First, the FSF outputted a ''license'' named ''other''. This ''license'' included at the time of observation 5282 known programs to FSF whose PCLs were not documented yet by the FSF. Although some of the programs had straightforward PCLs such as GPL-2.0-only we decided to leave these PCLs out of the scope of this thesis due to the large amount of the programs. The second note is about GNU's PCLs. Since we had the most trouble scraping the identifiers automatically from this website we decided to limit the PCLs only to ''Software Licenses'' as defined by the table of contents on the website.

In the second stage, the inclusion and exclusion criteria were applied to further filter the literature and reduce the number of licenses to be reviewed. This involved a manual review of the full licenses. The exclusion reason as a shortcode (e.g. I1 = failed to meet inclusion criteria 1 or E2 = met exclusion criteria 2) is provided in \hyperref[appendix:b]{Appendix B}

Expand Down Expand Up @@ -135,4 +137,13 @@ \section{Data collection and data analysis}

\textcolor{orange}{week 22 mon: we are not going to include fsf:license:other. for example babl seems to be in that category but is actually licensed under gplv3.}

\textcolor{orange}{week 22 tue: definitely not including fsf:license:other. there are a whopping 5282 programs whose license fsf just hasn't put up yet. i don't know what im going to do when i have to review the full licenses. for stage 3 licenses i should create a spreadsheet that doesn't allow duplicate attribute-having licenses. ok do stage inside spreadsheet as well and so that there is a simple clickable url to the license text. wayback machine in the end all of the urls via api or something.}
\textcolor{orange}{week 22 tue: definitely not including fsf:license:other. there are a whopping 5282 programs whose license fsf just hasn't put up yet. i don't know what im going to do when i have to review the full licenses. for stage 3 licenses i should create a spreadsheet that doesn't allow duplicate attribute-having licenses. ok do stage inside spreadsheet as well and so that there is a simple clickable url to the license text. wayback machine in the end all of the urls via api or something.}

\textcolor{orange}{week 23 mon: only software applicable licenses were chosen from gnu licenses since that's the only place that already categorized them and the only place with incredibly difficult to choose the license identifiers without manually writing them down}

\textcolor{orange}{week 23 wed: choosealicense appendix has "duplicates" but ig for stage 3 i could do the same kinda thing. maybe even justify with the appendix why agplv3re is the best license. i should make the stage 1 table so that it would have every place where the license was found from.}

\textcolor{orange}{week 23 thu: noo the appendix might not work. like where does json license stand in.}

remove the dates from this thingy and remove all the unnecessary thinking

44 changes: 43 additions & 1 deletion Ch.90_Appendix_1.tex
Original file line number Diff line number Diff line change
Expand Up @@ -12,4 +12,46 @@
\caption{A list of literature and the basic filtering step.}
\label{table:appendix:a}
\end{center}
\end{table}
\end{table}

\begin{table}[]
\begin{tabular}{c|c|c|c|c|c|c}
\textbf{Literature Id} & \textbf{Identifier} & \textbf{SPDX} & \textbf{DFSG} & \textbf{FSF} & \textbf{OSI} & \textbf{GNU} \\
L1 & 0BSD & Found in & & & Found in & \\
L2 & 996 & & & Found in & & \\
L3 & AAL & Found in & & & Found in & \\
L4 & Abstyles & Found in & & & & \\
L5 & AcademicFreeLicense & & & & & Found in \\
L6 & ACDL-1.0 & & & Found in & & \\
L7 & ACEL & & & Found in & & \\
L8 & AdaCore-doc & Found in & & & & \\
L9 & Adobe-2006 & Found in & & & & \\
L10 & Adobe-Display-PostScript & Found in & & & & \\
L11 & Adobe-Glyph & Found in & & & & \\
L12 & Adobe-Utopia & Found in & & & & \\
L13 & ADSL & Found in & & & & \\
L14 & AFL-1.1 & Found in & & & & \\
L15 & AFL-1.2 & Found in & & & & \\
L16 & AFL-2.0 & Found in & & & & \\
L17 & AFL-2.1 & Found in & & & & \\
L18 & AFL-3.0 & Found in & & Found in & Found in & \\
L19 & Afmparse & Found in & & & & \\
L20 & AGPL & & & & & Found in \\
L21 & AGPL-1.0-only & Found in & & Found in & & \\
L22 & AGPL-1.0-or-later & Found in & & Found in & & \\
L23 & AGPL-3.0-only & Found in & Found in & Found in & Found in & \\
L24 & AGPL-3.0-or-later & Found in & & Found in & & \\
L25 & Aladdin & Found in & & Found in & & Found in \\
L26 & Aladdin-9 & & & Found in & & \\
L27 & AMDPLPA & Found in & & & & \\
L28 & AML & Found in & & & & \\
L29 & AML-glslang & Found in & & & & \\
L30 & AMPAS & Found in & & & & \\
L31 & ANTI-1.3 & & & Found in & & \\
L32 & ANTI-1.4 & & & Found in & & \\
L33 & anticapitalist & & & & & Found in \\
L34 & ANTLR-PD & Found in & & & & \\
L35 & ANTLR-PD-fallback & Found in & & & & \\
L36 & Apache-1.0 & Found in & & Found in & & \\
\end{tabular}
\end{table}
Binary file modified HY-CS-main.pdf
Binary file not shown.
4 changes: 1 addition & 3 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -5,11 +5,9 @@ Repository for my MSc thesis

`latexmk`
## Timing
- gather identifiers from gnu using regexr

week 22 stage 1

week 23 stage 2
week 23 stage 1

week 24 stage 2

Expand Down

0 comments on commit b5ac19d

Please sign in to comment.