Show more

I have been doing a bit of mapping there and I like the feeling of digital rambling it gives… it's a fun way to travel :)

Show thread

Happy to be doing an OpenRefine workshop at SWIB (Semantic Web in Libraries) in November. Including reconciliation with Wikidata and GND, publishing data on Wikidata and mapping & exporting data as RDF. Very excited!
---
RT @swibcon@twitter.com
Registration for #swib19 is open: swib.org/swib19/registration.h Take a look at this year's programme: swib.org/swib19/programme.html
twitter.com/swibcon/status/115

In so many cases the maintainer has moved on and is no longer interested. We really need mechanisms to make project ownership more fluid. GitHub's notion of fork does not solve the problem, although it is a good first step: ownership of the package on PyPI / npm / other platforms is the key.

Show thread

How wonderful is it when you make a PR to an upstream library and it gets reviewed, merged and released in a timely manner? With constructive comments and improvements from the maintainer? It is so rare!

I'm looking for tooling (#Bash #Python #OpenRefine etc.) that processes Excel-formatted exports from Web of Science or similar into #DSpace-compatible CSVs.

Processing steps include: Renaming & subsetting columns, renaming to match #DublinCore fields, harmonising author names, retrieving missing info via #DOI-APIs etc.

Would be a neat programming lessons, but I'm looking for tried-and-tested code for production.

Please boost! 1k thanks for any hints 🙂 #Code4Lib #DataScience #LibraryCarpentry

Learning about and . Pretty mind blowing! I think rewriting 's GREL with Truffle would be a good first exercise.

Très bon papier dans le dernier @mdiplo sur la dématérialisation, l’illectronisme, et les services publics monde-diplomatique.fr/2019/08/

RT twitter.com/nitot
« Ajouter des nouvelles voies sur la route pour résoudre les embouteillages, c’est comme défaire se ceinture pour résoudre l'obésité » — Lewis Mumford, 1955 twitter.com/BrentToderian/stat

RT @datagouvfr@twitter.com

Sortie en beta test de csv-gg. Ce nouvel outil vous permet de créer un fichier CSV valide selon les spécifications d’un schéma référencé sur schema.data.gouv.fr. Retours utilisateurs bienvenus ! On prend aussi les félicitations sur le logo 😇

➡️ csv-gg.etalab.studio

🐦🔗: twitter.com/datagouvfr/status/

Presenting a model of workflows at the Applied Category Theory conference on Friday. It gives a 3D representation of a composition of operations and facets:
arxiv.org/pdf/1906.05937.pdf

@Phyks tu sais si brouter prend en compte les restrictions de bifurcation ?

"A survey of OpenRefine reconciliation services" by @pintoch: arxiv.org/abs/1906.08092

With first considerations of a reconciliation API that lets clients do the global scoring in a flexible manner based on servers providing field-level scores. Also mentioning lobid-gnd reconciliation service. ;-)

effectivethesis.com/project/

> Too many students write theses that achieve nothing other than meet the requirement for the degree they are seeking. When there are so many important questions, in a wide range of fields, that need to be answered, that's a terrible waste of time and intelligent, creative energy.

#EffectiveAltruism

Interested in entity on the web? Then join the Entity Community Group!
w3.org/community/blog/2019/06/

Starting with the reconciliation , we will document, improve and hopefully standardize a protocol to do entity matching at scale on the web.

Show more
La Quadrature du Net - Mastodon - Media Fédéré

The social network of the future: No ads, no corporate surveillance, ethical design, and decentralization! Own your data with Mastodon!