Skip to content

0.8.1

Latest
Compare
Choose a tag to compare
@lfoppiano lfoppiano released this 14 Sep 08:29
· 33 commits to master since this release

Added

  • Identified URLs are now added in the TEI output #1099
  • Added DL models for patent processing #1082
  • Copyrights owner and licenses identification models #1078
  • Add research infrastructure recognition for funding processing #1085
  • Add paragraphs coordinates in the TEI output #1068
  • Specify configuration file with DL models enabled for the full docker image #1117
  • Support for biblio-glutton 0.3 #1086

Changed

  • Update affiliation process #1069
  • Improved the recognition of URLs using (when available) PDF annotations, such as clickable links
  • Updated TEI schema #1084
  • Review patent process #1082
  • Add Kotlin language to support development and testing #1096

Fixed

  • Avoid splitting URLs between sentences #1097
  • Add missing sentence segmentation in funding and acknowledgement #1106
  • Docker image was optimized to reduce the needed space #1088
  • Fixed OOBE when processing large quantities of notes #1075
  • Corrected <title> coordinate attribute name #1070
  • Fix missing coordinates in paragraph continuation #1076
  • Fixed JSON log output
  • Fixed notes identification #1124
  • Fixed extraneous semicolon in the training data #1133
  • Reduced security vulnerabilities in the dependencies #1136 #1137

New Contributors