<div dir="ltr">** Apologies for cross-posting **<div><br></div><div><span id="gmail-docs-internal-guid-8d24db9c-7fff-f7e6-e7b9-a8acac333e3d"><p dir="ltr" style="line-height:1.38;text-align:center;margin-top:0pt;margin-bottom:0pt"><span style="font-size:14pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">2nd Call for Participation:</span></p><p dir="ltr" style="line-height:1.38;text-align:center;margin-top:0pt;margin-bottom:0pt"><span style="font-size:14pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Mining the Web of HTML-embedded Product Data</span></p><p dir="ltr" style="line-height:1.38;text-align:center;margin-top:0pt;margin-bottom:0pt"><span style="font-size:14pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">(co-located with ISWC2020)</span></p><br><p dir="ltr" style="line-height:1.38;text-align:center;margin-top:0pt;margin-bottom:0pt"><span style="font-size:12pt;font-family:Arial;color:rgb(255,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">(In line with ISWC2020, this event will be delivered online virtually. Prizes remain to be won!)</span></p><br><p dir="ltr" style="line-height:1.38"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">1. Overview</span></p><p dir="ltr" style="line-height:1.38"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">The Semantic Web Challenge on Mining the Web of HTML-embedded Product Data is co-located with the 19th International Semantic Web Conference (</span><a href="https://iswc2020.semanticweb.org/" style="text-decoration-line:none"><span style="font-size:11pt;font-family:Arial;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;text-decoration-line:underline;vertical-align:baseline;white-space:pre-wrap">https://iswc2020.semanticweb.org/</span></a><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">, 2-6 Nov 2020 at Athens, Greece). The challenge organises two shared tasks related to product data mining on the Web: (1) product matching and (2) product classification. This event is organised by The University of Sheffield, The University of Mannheim and Amazon, and is open to anyone. Systems successfully beating the baseline of the respective task, will be invited to write a paper describing their method and system and present the method as a poster (and potentially also a short talk) at the ISWC2020 conference. </span><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Winners of each task will be awarded 500 euro as prize</span><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap"> (partly sponsored by Peak Indicators, </span><a href="https://www.peakindicators.com/" style="text-decoration-line:none"><span style="font-size:11pt;font-family:Arial;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;text-decoration-line:underline;vertical-align:baseline;white-space:pre-wrap">https://www.peakindicators.com/</span></a><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">).</span></p><br><p dir="ltr" style="line-height:1.38"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">2. Challenge website</span></p><p dir="ltr" style="line-height:1.38"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">For details of the challenge please visit </span><a href="https://ir-ischool-uos.github.io/mwpd/" style="text-decoration-line:none"><span style="font-size:11pt;font-family:Arial;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;text-decoration-line:underline;vertical-align:baseline;white-space:pre-wrap">http://ir-ischool-uos.github.io/mwpd/</span></a></p><br><p dir="ltr" style="line-height:1.38"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">3. Important dates</span></p><p dir="ltr" style="line-height:1.38"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;text-decoration-line:line-through;vertical-align:baseline;white-space:pre-wrap">02 March 2020: Google support group open. Please join the group at </span><a href="https://groups.google.com/forum/#!forum/mwpd2020" style="text-decoration-line:none"><span style="font-size:11pt;font-family:Arial;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;text-decoration-line:underline line-through;vertical-align:baseline;white-space:pre-wrap">https://groups.google.com/forum/#!forum/mwpd2020</span></a><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;text-decoration-line:line-through;vertical-align:baseline;white-space:pre-wrap"> if you wish to take part in this event</span></p><p dir="ltr" style="line-height:1.38"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;text-decoration-line:line-through;vertical-align:baseline;white-space:pre-wrap">16 March 2020: Release of the training and validation sets </span><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">(Training datasets have been released!)</span></p><p dir="ltr" style="line-height:1.38"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">01 June 2020: Release of the test set (without ground truth) </span></p><p dir="ltr" style="line-height:1.38"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">15 June 2020: Submission of system output </span></p><p dir="ltr" style="line-height:1.38"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">08 July 2020: Publication of system results and notification of acceptance for presentation</span></p><br><p dir="ltr" style="line-height:1.2"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">4. Task and dataset brief</span></p><p dir="ltr" style="line-height:1.2"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">The challenge organises two tasks, product matching and product categorisation.</span></p><br><p dir="ltr" style="line-height:1.2"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">i) Product Matching</span><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap"> deals with identifying product offers on different websites that refer to the same real-world product (e.g., the same iPhone X model offered using different names/offer titles as well as different descriptions on various websites). A multi-million product offer corpus (16M) containing product offer clusters is released for the generation of training data. A validation set containing 1.1K offer pairs and a test set of 600 offer pairs will also be released. The goal of this task is to classify if the offer pairs in these datasets are match (i.e., referring to the same product) or non-match.</span></p><br><p dir="ltr" style="line-height:1.2"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">ii) Product classification</span><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap"> deals with assigning predefined product category labels (which can be multiple levels) to product instances (e.g., iPhone X is a ‘SmartPhone’, and also ‘Electronics’). A training dataset containing 10K product offers, a validation set of 3K product offers and a test set of 3K product offers will be released. Each dataset contains product offers with their metadata (e.g., name, description, URL) and three classification labels each corresponding to a level in the GS1 Global Product Classification taxonomy. The goal is to classify these product offers into the pre-defined category labels. </span></p><br><p dir="ltr" style="line-height:1.2"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">All datasets are built based on structured data that was extracted from the Common Crawl (</span><a href="https://commoncrawl.org/" style="text-decoration-line:none"><span style="font-size:11pt;font-family:Arial;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;text-decoration-line:underline;vertical-align:baseline;white-space:pre-wrap">https://commoncrawl.org/</span></a><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">) by the Web Data Commons project (</span><a href="http://webdatacommons.org/" style="text-decoration-line:none"><span style="font-size:11pt;font-family:Arial;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;text-decoration-line:underline;vertical-align:baseline;white-space:pre-wrap">http://webdatacommons.org/</span></a><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">). </span></p><br><p dir="ltr" style="line-height:1.2"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">5. Resources and tools</span></p><p dir="ltr" style="line-height:1.2"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">The challenge will also release utility code (in Python) for processing the above datasets and scoring the system outputs. In addition, the following language resources for product-related data mining tasks:</span></p><ul style="margin-top:0px;margin-bottom:0px"><li dir="ltr" style="list-style-type:disc;font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre"><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">A text corpus of 150 million product offer descriptions</span></p></li><li dir="ltr" style="list-style-type:disc;font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre"><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Word embeddings trained on the above corpus</span></p></li></ul><h3 dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:4pt;padding:0pt 0pt 7.5pt"> </h3><h3 dir="ltr" style="line-height:1.2;padding:0pt 0pt 7.5pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">6. Organizing committee</span></h3><ul style="margin-top:0px;margin-bottom:0px"><li dir="ltr" style="list-style-type:disc;font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre"><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Dr Ziqi Zhang (Information School, The University of Sheffield)</span></p></li><li dir="ltr" style="list-style-type:disc;font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre"><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Prof. Christian Bizer (Institute of Computer Science and Business Informatics, The Mannheim University)</span></p></li><li dir="ltr" style="list-style-type:disc;font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre"><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Dr Haiping Lu (Department of Computer Science, The University of Sheffield)</span></p></li><li dir="ltr" style="list-style-type:disc;font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre"><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Dr Jun Ma (Amazon Inc. Seattle, US)</span></p></li><li dir="ltr" style="list-style-type:disc;font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre"><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Prof. Paul Clough (Information School, The University of Sheffield & Peak Indicators)</span></p></li><li dir="ltr" style="list-style-type:disc;font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre"><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Ms Anna Primpeli (Institute of Computer Science and Business Informatics, The Mannheim University)</span></p></li><li dir="ltr" style="list-style-type:disc;font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre"><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:0pt"><span style="font-size:11pt;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Mr Ralph Peeters (Institute of Computer Science and Business Informatics, The Mannheim University)</span></p></li><li dir="ltr" style="list-style-type:disc;font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre"><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:12pt"><span style="font-size:11pt;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">Mr. Abdulkareem Alqusair (Information School, The University of Sheffield)</span></p></li></ul><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:12pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-weight:700;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">7. Contact</span></p><p dir="ltr" style="line-height:1.2;margin-top:0pt;margin-bottom:12pt"><span style="font-size:11pt;font-family:Arial;color:rgb(0,0,0);background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;vertical-align:baseline;white-space:pre-wrap">To contact the organising committee please use the Google discussion group </span><a href="https://groups.google.com/forum/#!forum/mwpd2020" style="text-decoration-line:none"><span style="font-size:11pt;font-family:Arial;background-color:transparent;font-variant-numeric:normal;font-variant-east-asian:normal;text-decoration-line:underline;vertical-align:baseline;white-space:pre-wrap">https://groups.google.com/forum/#!forum/mwpd2020</span></a></p></span><br class="gmail-Apple-interchange-newline"><div><div dir="ltr" class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><div dir="ltr"><br><div><font color="#000000">Kind regards</font></div><div><font color="#000000">------------------</font></div><div><font color="#000000">Dr Ziqi Zhang</font></div><div><span style="color:rgb(0,0,0)">Lecturer in Social Media, Exams Officer</span></div><div><font color="#000000">Room 323a, <span style="font-size:12.8px">Regent Court, 211 Portobello, </span></font><span style="font-size:12.8px;color:rgb(0,0,0)">Information School, </span><span style="color:rgb(0,0,0);font-size:12.8px">University of Sheffield</span></div><div><font color="#000000"><span style="font-size:12.8px">Tel: +44 (0)114 222 2657</span></font><br></div><div><font color="#000000">Other information and forms of contact: my iSchool <a href="https://www.sheffield.ac.uk/is/staff/zhang" target="_blank">webpage</a>, <a href="https://ziqizhang.github.io/" target="_blank">personal website</a>, <a href="https://www.linkedin.com/in/ziqi-zhang-68109615/" target="_blank">LinkedIn</a>, <a href="https://twitter.com/ziqizhang_zz" target="_blank">Twitter</a>, <a href="https://orcid.org/0000-0002-8587-8618" target="_blank">ORCID</a>, <a href="https://scholar.google.co.uk/citations?user=VsRwsN8AAAAJ" target="_blank">Google Scholar</a></font></div><div><br></div><div><br></div><div><span style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px"><img src="cid:BDA384AD-FE75-4DFC-96A0-76ABD6FAEA79@shef.ac.uk"></span><span style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px"><img src="cid:5EA17090-77FC-46B1-A541-4B2FB9045352@shef.ac.uk"></span><span style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px"><div><div dir="ltr"><div style="margin:0px;line-height:normal"><div dir="ltr"><i><br>Voted number one for student experience in the Russell Group and number three in the UK<br>Times Higher Education Student Experience Survey 2017</i></div></div></div></div></span></div><div><br></div><div><span style="font-size:12.8px">Find us on </span><a href="http://www.facebook.com/ischoolsheffield" style="color:rgb(17,85,204);font-size:12.8px" target="_blank">Facebook</a><span style="font-size:12.8px">, follow us on </span><a href="http://www.twitter.com/infoschoolsheff" style="color:rgb(17,85,204);font-size:12.8px" target="_blank">Twitter</a><span style="font-size:12.8px">, read our latest news on our </span><a href="http://information-studies.blogspot.co.uk/" style="color:rgb(17,85,204);font-size:12.8px" target="_blank">Blog</a><span style="font-size:12.8px"> and join our community on </span><a href="http://www.linkedin.com/company/university-of-sheffield-information-school?trk=tyah&trkInfo=tas%3Auniversity+of+sheffield+in%2Cidx%3A1-2-2" style="color:rgb(17,85,204);font-size:12.8px" target="_blank">LinkedIn</a><div style="font-size:12.8px"></div></div><div><br></div><div><div><b style="font-family:Arial,"Helvetica Neue",Helvetica,sans-serif;font-size:13px"><font color="#38761d"><i>I don't expect you to respond to my email outside your working hours. </i></font></b><br></div><div><span style="font-family:Arial,"Helvetica Neue",Helvetica,sans-serif;font-size:13px"><i><font color="#38761d">At the University of Sheffield we value and encourage flexible working patterns, so please be assured that I respect your working pattern and I am looking forward to your response when you are next in work.</font></i></span></div><br></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div></div>