{"id":139,"date":"2009-12-03T19:15:56","date_gmt":"2009-12-03T18:15:56","guid":{"rendered":"http:\/\/blogs.ukoln.ac.uk\/ukolndev\/?p=139"},"modified":"2013-06-08T13:45:37","modified_gmt":"2013-06-08T13:45:37","slug":"writeslike-us-wins-and-fails","status":"publish","type":"post","link":"https:\/\/www.emmatonkin.com\/ukolndev\/2009\/12\/03\/writeslike-us-wins-and-fails\/","title":{"rendered":"writeslike.us: Wins and Fails"},"content":{"rendered":"<p>Wins:<br \/>\n\u27a2 Getting information such as institution names\/URLs from Wikipedia, and widespread use of available web services in general<br \/>\n\u27a2 Extracting names from OAI-DC was easier than expected &#8211; although there are still issues with identifying name pair order.<br \/>\n\u27a2 Evidence based learning methods can be applied successfully to the data retrieved to enhance it &#8211; getting into FixRep territory. The project has been very useful for the purpose of establishing further use cases for &#8216;cleaning up&#8217; metadata.<br \/>\n\u27a2 Some interesting work in name \/ identity disambiguation through statistical clustering analysis. We&#8217;re looking at linking extracted info together with formal information such as that made available by the NAMES project.<br \/>\n\u27a2 Storyboards defining the workflow of the system form an effective part of the agile development process, and were very useful for us.<br \/>\n\u27a2 Using an SQL db as the repository was effective once problems with slow queries was addressed through: normalizing data, reviewing db schema design, adding indexes as necessary.<\/p>\n<p>Fails:<br \/>\n\u27a2 Natural Language Tool Kit &#8211; didn&#8217;t use it for its original purpose. Instead, went back to the Tree Tagger, although this was not specifically trained for the sort of technical document we were analysing.<br \/>\n\u27a2 Text analysis expertise required for this project wasn&#8217;t already extant in the team. It would&#8217;ve been a good idea to have ensured training for team to make sure we were all on the same page!<br \/>\n\u27a2 Ensure all related documents, URIs, etc, are contained\/linked in the project wiki.<br \/>\n\u27a2 Cultural mismatch between research approach to defining requirements\/expectations and development requirements\/expectations. e.g. who writes the formal requirements document?<br \/>\n\u27a2 Earlier storyboard scenario development would have been helpful, so a good lesson for next time.<br \/>\n\u27a2 Swine flu and its effects were quite severe on this project &#8211; our Portugese collaborators were unavailable for quite some time due to a) the danger of traveling to the UK and contracting the virus, and (subsequently to contracting the illness in Portugal) b) the effects of the illness!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Wins: \u27a2 Getting information such as institution names\/URLs from Wikipedia, and widespread use of available web services in general \u27a2 Extracting names from OAI-DC was easier than expected &#8211; although there are still issues with identifying name pair order. \u27a2 Evidence based learning methods can be applied successfully to the data retrieved to enhance it [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":[],"categories":[10,18],"tags":[47,170,60,171,77,83,97,174],"_links":{"self":[{"href":"https:\/\/www.emmatonkin.com\/ukolndev\/wp-json\/wp\/v2\/posts\/139"}],"collection":[{"href":"https:\/\/www.emmatonkin.com\/ukolndev\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.emmatonkin.com\/ukolndev\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.emmatonkin.com\/ukolndev\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/www.emmatonkin.com\/ukolndev\/wp-json\/wp\/v2\/comments?post=139"}],"version-history":[{"count":2,"href":"https:\/\/www.emmatonkin.com\/ukolndev\/wp-json\/wp\/v2\/posts\/139\/revisions"}],"predecessor-version":[{"id":1872,"href":"https:\/\/www.emmatonkin.com\/ukolndev\/wp-json\/wp\/v2\/posts\/139\/revisions\/1872"}],"wp:attachment":[{"href":"https:\/\/www.emmatonkin.com\/ukolndev\/wp-json\/wp\/v2\/media?parent=139"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.emmatonkin.com\/ukolndev\/wp-json\/wp\/v2\/categories?post=139"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.emmatonkin.com\/ukolndev\/wp-json\/wp\/v2\/tags?post=139"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}