Visualising China launched

Visualising China was launched yesterday at the “Treaty Ports in Modern China” conference held at the University of Bristol. The conference is organised by the Department of Historical Studies as part of an Economic and Social Research Council-funded project entitled “Colonialism in Comparative Perspective: Tianjin under Nine Flags, 1860-1949“.

The site now allows users to explore more than 8000 digitised images of historical photographs of China taken between 1850 and 1950. It cross-searches metadata from Historical Photographs of China, the Sir Robert Hart Collection and Joseph Needham’s Photographs of Wartime China. In addition we also include resources chosen from Google Books by subject experts. Users are encouraged to annotate existing data with suggestions and in the future we plan to offer researchers and others the ability to “add” other relevant resources by incorporating moderated online content which can then be browsed and searched through this single point of access (for example images on Picasa, Flickr, Internet Archive, personal libraries of related Google books, videos from YouTube or online journals).

Posted in Digitisation | Leave a comment

Important new content and a pre-release peek at the website

Lots of news to report and lots of work still going on as we near the end of the project (end of March) and look forward to user interaction with the Visualising China website … We’re delighted that over 1700 new images have recently been added and are now searchable via Visualising China. These include the Sir Robert Hart collection of photographs and slides from Queen’s University Belfast and the Joseph Needham Photographs of Wartime China, 1942-1946 from the Needham Research Institute in Cambridge. These very important collections have both broadened and deepened the content reachable through Visualising China and we look forward to announcing more such collaborations in the near future. Alongside these structured and scholarly resources, we continue to add individual albums and photos, from trunks and attics as well as from the Internet.

We have also made a pre-release alpha version of Visualising China available for all friends of the project to look at – to see what we have working on for the last few months. The release version of the website is planned for the end of March, so there are still a lot of things to bring in to it and several major issues and bugs that we know about, but do have a look at it and see how things have progressed since the early prototypes.

http://visualisingchina.net/

Background on the Hart collection:
http://digitalcollections.qub.ac.uk/digital-image-gallery/hart/
Background on the Needham collection:
http://www.nri.org.uk/JN_wartime_photos/home.htm

Posted in Digitisation | Tagged , , | Leave a comment

Technical workshop on Cross-searching digital collections

On the 21st September, Visualising China held a very useful workshop in Bristol, looking at the problems of cross-searching multiple remote collections by providing users with a search over integrated data, harvested periodically from target sites/services.

The workshop aimed to enable shared understanding and expertise as opposed to attempting to define a wholesale solution to the myriad problems surrounding cross-searching. Problems range from how to do the ranking of search results to how to sustain a cross-search service when searchable endpoints inevitably change their delivery mechanisms over time.

Presentations were offered from SOAS, the Connected Histories project, Visualising China and Edina’s Aggregations Scoping Study.

Summary of Presentations

Visualising China – described how we in all cases harvest and store metadata, as opposed to storing binary image files or texts. We use a Linked Data/Semantic Web solution to harvest, store and index:
1. A Java client library to the Google Books Libraries API to support building RDF from that datasource,
2. Query via SQL on the Image collection metadata from the server in France (where are hosted our Historical Photographs of China), with data transformed to RDF for the backend, and
3. We similarly XSLT the Belfast XML metadata (Sir Robert Hart image collection) to RDF.
We use a homegrown solution to cross-query across multiple SPARQL endpoints – Arnos, software we have developed for this and other projects. Further issues discussed here included, how to auto-generate/capture geo-locations, timelines and controlled vocabularies across disparate content, people/resource identity matching across interrelated collections, index coverage to support the appropriate level of user searching, searching user generated content and the scalability of the Arnos solution.

Connected Histories – described how they periodically harvest and create search indexes (using Lucene) on content collections – primarily Old Bailey proceedings online. They are also doing automated entity recognition technique via NLP on some of the unstructured OCR-derived data. Their federated search facility has been upped from 9 datasets to now cover 14. A key issue here then is scalability – and they doubt that using RDF/Semantic Web will give them a scalable solution. Other issues include negotiating lengthy license agreements with commercial providers, and the time consuming process involved ‘understanding’ the nature of the metadata/content from each different provider.

Workshop corridor chatSOAS – Theirs is a joint project with Yale University, connecting Arabic resources across the two parties. The mapping of metadata schemes to provide an integrated search has proved difficult, as the interpretation of common standards (such as Dublin Core) has not been applied in exactly the same way by both parties. Malcolm Raggett talked in depth about the importance of providing solutions that are fully appropriate to an end-user and what the technical implications of these are.

EDINA scoping study – The Research Discovery Taskforce vision is to have a “collaborative, aggregated and integrated research discovery and delivery framework”… So this study looked at the sub area of this: Aggregations of Metadata about Images and Time based Media. The study will release its final report in the coming weeks, and Sheila Fraser gave an indication of what it will reveal. She discussed five different aggregator models they came across in the study (through surveys and targeted interviews). Issues and findings discussed here included: aggregations tailored to subject disciplines may well be the best approach, complex multimedia metadata is still needed (in lieu of improvements in image recognition for example), common metadata schemas do not prevail therefore there’s the suggestion that use of a hybrid aggregator model – combining with a Linked Data approach – could be the most workable solution, user tagging usefully enriches metadata, aggregations should be made available to other aggregations and user centred design and focus is essential in this area. Sheila also demo’d the VSM Portal Demonstrator (http://edina.ac.uk/projects/vsmportal/).

Feedback from participants after the workshop was very positive. Participants are all keen to stay in touch to share reports and experiences. Many thanks to all who came!

Posted in Digitisation | Leave a comment

Report from User Engagement workshop

On 7th September we held a successful user engagement workshop to coincide with the British Association for Chinese Studies annual conference.
attendees at Visualising China user engagement workshop
The workshop was attended by eminent academics, postgraduates and researchers from half a dozen different countries and was extremely valuable for the project team. We were able to discuss the scenarios in which our participants looked for photographs to support their work and were also able to show them an early prototype of the planned Visualising China tool and to hear their comments and reactions. After the workshop, many of the attendees stayed on for a meal together which proved an entertaining social experience as well as providing valuable and fascinating professional contacts. Thank you to all the participants for giving us the benefit of their expertise and opinions.

The next project event will be more technically focussed. More news later.

Posted in Digitisation | Tagged | Leave a comment

New Project Landing Page and User Engagement Plans

This month we are launching the project “landing page”, designed to help people quickly understand the aims of the project, to sign up for project updates, and to offer input if they wish to. It is primarily aimed at researchers of modern Chinese History (or British in China history) as key stakeholders in the Visualising China demonstrator tool we are building.

Visualising China Landing Page Screen Shot

We are hosting a workshop in Bristol on 7th September to find out more about researcher needs and workflows in this area, and to show and discuss the latest version of the tool. This is co-located with the 2010 Conference of the British Association for Chinese Studies, also being held here in Bristol from the 8th September, so we hope to attract some valuable participants in order to inform further development of our software.

Posted in user engagement | Leave a comment

Plans to Cross Search Related Content

We held another very productive Steering Committee meeting last month with Malcolm Raggett joining us from SOAS. Malcolm gave really useful input drawn from SOAS experiences in offering content collections online and issues such as problems with the scalability of cross-searching and the limitations of and options for 2D information visualisation.

We have also been having discussions with Queens Belfast regarding their excellent, online Sir Robert Hart collection. Belfast Sir Robert Hart Collection - sample record
Work on cross-searching our photographs of China collection with theirs will start shortly. We will also hold a cross-search themed workshop for technical developers in the next few months and will be seeking interest from across the JISC e-content programme and beyond.

Finally, user engagement is ready to start in earnest as we are about to launch a new “landing page” for researchers to register interest in our project. Further user engagement opportunities are planned around a workshop we will run in the Autumn and the BACS (British Association for Chinese Studies) conference to be held here in Bristol in early September.

Posted in cross-searching | Tagged | Leave a comment

Visualising China Demo at JISC Conference 2010

Earlier this month Simon Price and Nikki Rogers attended the JISC Conference 2010 in London. Nikki, the Visualising China project manager, was an invited panel member on a discussion session about maximising creativity of software developers and lessons learnt from earlier ILRT projects under JISC Rapid Innovations programme. Simon, one of the Visualising China developers, gave a demonstration presentation of the latest release of the software and a sneak preview of the forthcoming release.

Virtual Earth Map View

The demo gave a whirlwind tour of the existing image cross-searching and annotation features described in earlier posts on this blog, before moving on to briefly look at the new capability allowing users to link external resources to the existing knowledge base, for example, adding a flickr image as a referable, annotatable, searchable and visible item. Finally the new, soon to be released, map view was shown. The map view overlays both geographically located items, such as photos of a location or people or events, on top of a selection of different map backgrounds. Backgrounds include third-party maps Microsoft (Bing) Virtual Earth, Google Maps and Yahoo Maps as well digitised renditions of historical maps of China.

Old Map

Posted in Image Annotation, Social Software and Web 2.0, Visualisation | Tagged , | Leave a comment

Linking the Knowledge Base to External Content

The Visualising China demo now links to all the images, plus some metadata, in the Historical Photographs of China collection (which was first put online at http://chp.ish-lyon.cnrs.fr/).

One of the goals in this project is to allow end-users (researchers and so on) to be able to effectively add new, online images into the system’s “knowledge base” simply by entering their URL. We have implemented a version of this.

Example: Suppose I am researching the Boxer uprising in China 1900-1 and have found an image on Flickr of particular interest due to its depiction of some Australian contingents sent at that time to China. I want to be able to annotate and link this picture into other resources I have already linked via the Visualising China (these would perhaps be photographs already in the Historical Photographs of China, a YouTube video, and a newspaper article elsewhere online). How do I do this? – using “Add New Resource” (I must be logged in to do this – currently supported by OpenId login):

Clicking to Add New Resource

I paste the URL of the flickr image (in this case “http://farm3.static.flickr.com/2025/1689131855_83111deda9.jpg”) into the form and add any other details into the text boxes provided. If I save this newly added resource it will now appear as part of the knowledge base in Visualising China. If I now search on “NSW” (part of the title I gave the new resource) it will appear as a result hit:

Viewing the newly added resource in Visualising China

.

Posted in Digitisation | Tagged | Leave a comment

Developing the branding concept for Visualising China

To complement the project developer team we have web designer Ben Hayes on board to help us develop our designs for the Visualising China tool. Stuart Church, our user experience consultant, gave invaluable help earlier in the project to develop our description of Visualising China’s target audience. Our primary target audience is UK academic researchers, with several other secondary audiences, the main ones of these being UK learners (in schools, colleges or Universities) and members of the public – in particular genealogists.

Using knowledge of who the intended target audience are, we then held a consultation meeting with stakeholders, facilitated by Ben who has come up with the first general design for Visualising China:

Visualising China Design Mock Up

Mockup of the new overall design for Visualising China

Ben writes “I took Andy Degg’s excellent original design as a starting point and tried to give it a slightly more modern, high-tech feel, since the new project is more of an interactive tool, rather than an exhibition. The result is a balance between the historical nature of the subject matter, and the fresh, modern feel of a web application”.

Ben emphasises that this is not meant to be a complete visual design for the web application, but is to assist us in working on the branding for the Visualising China tool. Eventually the finalised visual design may have a different layout to this.

Great work Ben, thanks! Now over to the developers to implement it …

Posted in Design | Tagged | Leave a comment

Early Demo and Discussion with Steering Committee Members

An excellent meeting took place yesterday, bringing together key players in the Visualising China project, including our mentor, Grant Young from the University of Cambridge. Grant is providing a very useful “critical friend” role on the project.

Before Christmas the development team acquired full access to the Historical Photographs of China collection stored on a French partner’s server and at yesterday’s meeting Jasper Tredgold gave an early demo of what our development team have achieved so far, having harvested metadata plus image links for the collection and importing it into the software developed in our predecessor JISC-funded STARS project.

Visualising China demo screen shot showing list of search results and a record about Lily Ho Screen shot from Visualising China demo showing map view and Lily Ho record

The demo prompted lots of useful discussion about how users of the website are able to annotate “resources” and link them – semantically and visually – to other resources in the “knowledge base”.

We discussed issues around the linking to third party sources (such as Flickr and You Tube) that Visualising China usefully offers, but which could also raise political sensitivity concerns, potential levels of authority and control over the social software side of Visualising China and possibilities for the ranking and filtering of social annotations according to knowledge about the author who contributed them. The software will continue to be developed to support image annotation and timeline and historical map views of content, as well as integration of more third-party content as time goes on.

Some more excellent discussion took place with our User Experience consultant, who is leading the way we clearly articulate our goals and primary audience for Visualising China at this stage, and also with our Large Data Storage people at the University of Bristol, who are kindly working with us with a view to sustainability beyond the project’s end.

Posted in Image Annotation, Social Software and Web 2.0, Visualisation | Tagged | Leave a comment