<div dir="ltr"><span style="font-size:10pt;font-family:Arial,sans-serif">Call for Demos: The First International Workshop on Big Data Discovery and Curation </span><span style="font-size:12pt;font-family:'Times New Roman',serif"><br>
<br></span><span style="font-size:10pt;font-family:Arial,sans-serif">Traditionally, data warehouses have been used to provide business users ways to consolidate information from different sources for analysis and reporting. For getting data ready for analysis, ETL (extract-transfrom-load) is used which involves reading data from different sources, cleaning the data, converting the format of the input data so that it conforms to the target database, and writing it to the target database. Big data paradigm is changing this problem due to three V’s: volume, velocity, and variety. In big data paradigm, potentially a large number of data sources and data assets are considered for analytics. One needs to discover, integrate, and analyze large volumes of diverse data quickly.</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br>
</span><span style="font-size:10pt;font-family:Arial,sans-serif">Finding relevant data for analytics is an important data discovery problem. Data diversity makes this problem difficult. The diversity of the data can be due to data model; type of data—structured, semi-structured, or unstructured; enterprise data vs. open public data; integrating social media data, etc. One also needs to handle data quality and data governance issues. In this workshop we invite demonstrations displaying techniques for identifying relevant sets of data, finding different kinds of relationships between structured, semi-structured, and unstructured data, curating the data for further analysis, integrating data using various join, union, and merge techniques, validating the integrated data, and analyzing it, from various industry domains. </span><span style="font-size:12pt;font-family:'Times New Roman',serif"><br>
</span><span style="font-size:10pt;font-family:Arial,sans-serif">Topics of interest include (but are not limited to):</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br></span><span style="font-size:10pt;font-family:Arial,sans-serif">o Cleaning big data</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br>
</span><span style="font-size:10pt;font-family:Arial,sans-serif">o Integration of big heterogeneous data</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br></span><span style="font-size:10pt;font-family:Arial,sans-serif">o Metadata extraction</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br>
</span><span style="font-size:10pt;font-family:Arial,sans-serif">o Automated rule generation</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br></span><span style="font-size:10pt;font-family:Arial,sans-serif">o Curating data</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br>
</span><span style="font-size:10pt;font-family:Arial,sans-serif">o Data discovery</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br></span><span style="font-size:10pt;font-family:Arial,sans-serif">o Provisioning and data lineage</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br>
</span><span style="font-size:10pt;font-family:Arial,sans-serif">We welcome good demonstrations, including of previously accepted papers/demos, for this workshop. Authors need to send manuscript describing the demo in up to 2 pages (2 column format) inclusive of all references and figures. Manuscripts must be written in English, and formatted according to IEEE proceedings templates. Please see the workshop website </span><span style="font-size:12pt;font-family:'Times New Roman',serif"><a href="https://sites.google.com/site/bddc2014/"><span style="font-size:10pt;font-family:Arial,sans-serif">https://sites.google.com/site/bddc2014/</span></a></span><span style="font-size:10pt;font-family:Arial,sans-serif"> for more details.</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br>
</span><span style="font-size:10pt;font-family:Arial,sans-serif">Important dates:</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br></span><span style="font-size:10pt;font-family:Arial,sans-serif">Demo proposals due: July 5, 2014</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br>
</span><span style="font-size:10pt;font-family:Arial,sans-serif">Notification of acceptance: July 15, 2014</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br></span><span style="font-size:10pt;font-family:Arial,sans-serif">Workshop: August 24, 2014</span><span style="font-size:12pt;font-family:'Times New Roman',serif"> <br>
</span><div><span style="font-size:12pt;font-family:'Times New Roman',serif"><br></span></div><div><div style><font face="Times New Roman, serif" size="1">To subscribe to this list, the user sends an email, with blank subject line, to <a href="mailto:listserv@lists.drexel.edu">listserv@lists.drexel.edu</a> . In the text box, the user types: subscribe BIGDATA.</font></div>
<div style><font face="Times New Roman, serif" size="1">To unsubscribe from a list, the user sends an email to <a href="mailto:listserv@lists.drexel.edu">listserv@lists.drexel.edu</a> with the message: signoff BIGDATA.</font></div>
</div></div>