Skip to content

Latest commit

 

History

History
164 lines (112 loc) · 10.2 KB

File metadata and controls

164 lines (112 loc) · 10.2 KB
<< Back to the homepage of this tool >> To the Github repo of this page

Technical notes (under construction)

Latest update: 16 September 2024

This page gives more info about

  1. The structure of the GLAMorousToHTML repository, its files and folders
  2. Short description of their functions (see the docstrings for more detailed functional descriptions)
  3. How to run this repo yourself
  4. Change log
  5. Features to be added

Repository structure and functional descriptions

What are the main files and folders in this repo, and what do they do?

Main folder

category_logo_dict.json category_logo_dict_nde.json

build_html.py

build_excel.py

analytics.py

wikidata_cache.json

README.md - this file

pagetemplate.html

GLAMorous_MediacontributedbyKoninklijkeBibliotheek_Wikipedia_Mainnamespace_10012024.html

Subfolders


Running the scripts yourself

To follow..


Change log (needs updating)

xx April 2024

  • Reports

14 March 2024

  • Included reports for 14 institutions from Australia and New Zealand.

29 February 2024

  • Included reports for institutions from Norway, Sweden, Finland and Sweden.
  • README.md: Added explanations how you can run the script yourself.

14 February 2024

  • Refactored all code into multiple separated modules: setup.py, general.py, buildHTML.py and buildExcel.py. This has reduced the complexity of the main script GLAMorousToHTML.py significantly and made the total suite of code much more modular and easier to understand, maintain and expand.
  • Moved all HTML report pages into a separate site/ folder. This has made the repo much cleaner, clearer and more maintainable.
  • Created five HTML files that redirect the old KB HTML pages (from 27-01-2022 to 16-01-2024) to the new equivalent ones in "/site" folder. Did not implement redirection for other institutions.
  • Per 14-02-2024 added Excel outputs in data/ folder, to be used as structured input for data applications, such as OpenRefine
  • In the proces of updating the data structure in category_logo_dict.json, where the new structure can be seen under the 'Netherlands' key.
  • Improved pagetemplate.html to be key based ({numarticles} Wikipedia articles) rather than index based ({0} Wikipedia articles)

Features to add