DISA

Centre for Data Intensive Sciences and Applications

DISA Seminar May 9th

2022-05-06

Welcome to our DISA seminar with invited guest Themis Palpanas from The Data Intelligence Institute of Paris (diiP).

  • When? Monday May 9th 12-13
  • Where? Via Zoom – link will be sent to those who sign up
  • Registration: https://forms.gle/nE4nsNTKfy2FJ4pm7

Data Intelligence Institute of Paris: Creating a diiP Connection to DISA and Linnaeus University
The Data Intelligence Institute of Paris (diiP) is an interdisciplinary initiative of Université Paris Cité (France). It is a laboratory that fosters and supports the emergence of interdisciplinary practices around data science and data intelligence. It gathers researchers from formal sciences, physical sciences, life sciences and social sciences. In this talk, we will describe the goals of diiP and its operation, and try to draw parallels with DISA. We hope that this seminar will initiate discussions and eventually collaborations between the two institutes and universities.

Themis Palpanas is Senior Member of the French University Institute (IUF), a distinction that recognizes excellence across all academic disciplines, and professor of computer science at the University of Paris (France), where he is director of the Data Intelligence Institute of Paris (diiP), and director of the data management group, diNo. He received the BS degree from the National Technical
University of Athens, Greece, and the MSc and PhD degrees from the University of Toronto, Canada. He has previously held positions at the University of California at Riverside, University of Trento, and at
IBM T.J. Watson Research Center, and visited Microsoft Research, and the IBM Almaden Research Center.

His interests include problems related to data science (big dataanalytics and machine learning applications). He is the author of 9 US patents (3 of which have been implemented in world-leading commercial data management products), and 2 French patents. He is the recipient of 3 Best Paper awards, and the IBM Shared University Research (SUR) Award. He is currently serving on the VLDB Endowment Board of Trustees, as an Associate Editor in the TKDE, and IDA journals, as well as on the
Editorial Advisory Board of the IS journal, and the Editorial Board of the TLDKS Journal. He has served as Editor in Chief for the BDR Journal (that he drove to an impact factor of 3.578 and cite score of
8.6), as General Chair for VLDB 2013, Associate Editor for VLDB 2022, 2019 and 2017, and Research PC Vice Chair for ICDE 2020.

DISA Seminar April 4th Marie Skłodowska Curie postdoctoral fellowships (MSCA-PF)

2022-03-10

  • When? April 4th, 2022 12-13
  • Where? Online, you will get a link to the event when you have registered
  • Registration: https://forms.gle/M88bznTfa8CYDjjY7

DISA welcomes you to an information seminar focused on information and advice on how to develop a competitive proposal in the upcoming call for Marie Skłodowska Curie postdoctoral fellowships (MSCA-PF). The seminar will include presentations illustrating four relevant perspectives – key information, evaluation of proposals, hosting an MSCA-PF fellow and being an MSCA-PF fellow. As a fellow you apply for the fellowship with a PI, and as a PI you apply for the fellowship with an identified fellow. The seminar will provide researchers interested in becoming a supervisor for a MSCA PF with information on the fellowship programme, the requirements of a supervisor, practical information, and insights on how to write a competitive proposal from the supervisor’s perspective.

The MSCA PF tool is an outstanding prospect to expand your research group by an extremely skilled and funded postdoctoral researcher for two years while adding complementary skills and competencies to your group.

Speakers:

From the programme:

  • What is MSCA Postdoctoral Fellowships?
  • Proposal writing process
  • What is required from you as a Supervisor? The MSCA Guidelines on Supervision.
  • LNU as a host institution
  • Yes, I want to be a MSCA supervisor. What do I do now?
  • Experiences from a LNU MSCA PF Supervisor
  • Q&A

About MSCA PF

The objective of the Horizon Europe MSCA-PFs is to support researchers’ careers and foster excellence in research. The Postdoctoral Fellowship’s action targets researchers holding a PhD who wish to carry out their research activities abroad, acquire new skills and develop their careers. PF grants help researchers gain experience in other countries, disciplines and non-academic sectors. The grant usually covers two years’ salary, a mobility allowance, research costs and overheads for the host institution. Applications should be co-written by the researcher and the host organization. MSCA PF is open to all scientific disciplines.

Visit the European Commission’s website for additional information.

 

KvarkenSat Innovation Challenge 2022 on Sustainable Forestry

2022-02-10

The DISA forestry group invites you to KvarkenSat Innovation Challenge 2022 on Sustainable Forestry that will start in next week with a pre-hackathon followed by the hackathon using space-based data helping to combat climate change!

Acceptance based on submission, the best submissions may be approved early.

The challenges

Climate change brings about major changes affecting us all. Extreme weather events become more frequent and especially the amount of rainfall increases in Northern Europe, one contributor being the warmer winters. New species of both vegetation and animals enter new areas while the existing species might have even major changes in their habitats. These lead to new challenges in the forestry industry. We are looking for ideas and solutions combining existing knowledge and datasets with space-based data and datasets based on satellite measurements, in four particular themes including soil moisture, spruce bark beetles, forest ground damage and the forest value chain.

Who can apply?

The hackathon is open to students, teachers, researchers and start-ups in teams of 3-5 persons. Relevant expertise to participate include: space and satellite data, machine learning and neural networks, computer science, positioning systems, automation, image processing/recognition, engineering, logistics, business/communications and forestry.

Awards

The three best proposals across all of the themes will be awarded a cash prize (over 100 000SEK) and possible continuation/acceleration within start-ups and innovation programs.

Pre-Hack Webinar

To get familiar with the hackathon, meet the mentors and partners, and participate in Q&A-session join our webinar on 15 February at 13.00 (14.00 Finnish time).

Link to the join the webinar: https://bit.ly/KvarkenSatWebinar

More information about the hackathon: https://ultrahack.org/kvarkensat-innovation-challenge-2022

Seminar October 18th – Future Position X

2021-10-01

  • When? Monday October 18th 12-13
  • Where? Online, link will be sent to those who sign up via this link https://forms.gle/NTo7jnysyLkBWaAm8 no later than October 15th

During the seminar Magnus Engström, CTO at Future Position X (FPX) will talk about two clear cases where FPX with data science has contributed to creating the conditions for a viable city center by collecting and combining data from different sources. More specifically, it will be about how we have applied machine learning to be able to predict movements in the city center and how we with a data-driven approach have created an application that helps the University of Gävle to conduct research on how Gävle residents experience their local environment. The presentation will be followed by a Q&A and discussion about potential collaborations with researchers from Linneaus University

Future Position X is an independent Swedish innovation center that works for growth through better health and well-being in the smart, sustainable and vibrant city. FPX contributes both technology and expertise to develop data-driven community solutions.

By initiating projects, creating relationships and building collaborations, FPX contributes to collaboration between business, academia and the public sector. FPX contributes to knowledge development of new technology by creating meeting places and networks around data-driven innovation such as GIS, AI, Internet of Things and blockchain technology. FPX also provides technical solutions, including the Innovation Platform, a data platform that can be used to digitally model societies. We are an important player in the work of strengthening both society and companies to a more sustainable growth.

 

 

Workshop “Critical perspectives on cultural heritage: Re-visiting digitisation” 26 October, 9-12hrs

2021-09-28

Organizers: The workshop is co-organized by Linnaeus University (Centre for Applied Heritage and iInstitute) and Swedish National Heritage Board

Website: https://lnu.se/en/meet-linnaeus-university/current/events/2021/critical-perspectives-on-cultural-heritage-re-visiting-digitisation/

About: Today, the Semantic Web and Linked Open Data are creating new value for the descriptive information in the cultural heritage sector. Libraries, museums, heritage management and archives are seeing new possibilities in sharing by turning their catalogues into open datasets that can be directly accessed, allowing cultural heritage data to be circulated, navigated, analyzed and re-arranged at unprecedented levels. This is supported by research funding bodies, governments and EU policies and numerous political interests, resulting in enormous investment in digitization projects which make cultural heritage information openly available and machine readable. But before deploying this data, one must ask: is this data fit for deployment?

Libraries, museums, heritage management and archives have long histories. Both the collections they house and the language they use(d) to describe said collections are products of that historical legacy, shaped by, amongst others, institutionalized colonialism, racism and patriarchy. Yet descriptive information is now being digitized and shared as if that legacy is not inherent to the collections. Instead, existing units of information are being distributed through new Web 3.0 technologies, bringing with it an outdated knowledge-base. Besides the risk of progressive techniques being applied to regressive content, we may also sacrifice the development of new knowledge in libraries, museums, heritage management and archives aimed at facilitating socially sustainable futures, remediating exploitative historical legacies.

For this workshop, we have invited researchers and practitioners to discuss ways in which digitisation approaches may be set up to change the nature and legacy of cultural collection prior to digital dissemination.

Welcome!

DISA Seminar October 4th on Aggregation as Unsupervised Learning and some of its Applications

2021-09-24

  • When? October 4th 12-13
  • Where? Online – the link will be sent to those who sign up
  • Registration? Sign up via this link no later than October 3rd.

This seminar will be presented by the DISTA research group within DISA, you will meet and listen to Welf Löwe, Maria Ulan, Morgan Ericsson, Anna Wingkvist

Aggregation combines several independent variables to a dependent variable. The independent variables are different, possible mutually dependent observations of a real world. The dependent variable should preserve properties of the independent variables, e.g., the ranking or relative distance of the independent variable tuples, and ultimately the properties of the real world. However, while there usually exist large amounts independent variable tuples, there is no ground truth data available mapping these tuples to the corresponding dependent variable values. This makes aggregation an unsupervised machine learning problem, as opposed to, e.g., regression where data comprises independent variable tuples and the corresponding dependent variable values.

Instances of the problem frequently occur in software engineering, e.g., when trying to assess the quality of software by metrics. Metrics (independent variables) can easily be measured for a lot of software artifacts, but it is hard to measure quality (dependent variable). Instances also occur in many other assessment situations including, but not limited to the assessment of project proposals, financial investments, and human movements.

In our talk, we present
1) aggregation as unsupervised learning including unweighted and weighted approaches
2) ways to evaluate and compare different aggregation approaches including an evaluation of the approaches introduced in 1)
3) applications to software engineering problems applying the evaluation introduce in 2)

The recording of this session and previous recordings will be available at the following link

NEW DISA Seminar Series starting September 6th 12-13

2021-09-02

We are now finally starting a new Seminar series within DISA, even if you are not affiliated with DISA you are welcome to attend.

Aim with the seminar series:
Our research centre now have some 10 different research groups, each comprising a trending research topic. In order to make those different subjects of expertize more known outside of the own group and more accessible to PhD students we now launch a research seminar series.

Out first lunch seminar series will be on Monday September 6th 12-13 with Thomas Holgersson
Link to the seminar: https://lnu-se.zoom.us/j/63536937748 (no sign up needed)

Titel: Matrices in different dimensions: high, low and in between
Abstract: I will survey some common methods for statistical analysis of random matrices in fixed and in increasing dimensions. The geometry of high-dimensional objects will discussed from a data-analytic perspective. I will also cover some different modes of asymptotics, with particular focus on scalability.
Keywords: Wishart ensambles, geometry of high-dimensional objects, spectral analysis, Mahalanobis distance, modes of convergence.

Kind regards,
Thomas and Diana

iInstitute / Digital Humanities webinar: The Ethics of Datafication and AI by Geoffrey Rockwell

2021-05-18

Summary – We all want artificial intelligence to be responsible, trustworthy, and good… the question is how to get beyond principles and check lists. In this paper I will argue for the importance of the data used in training machines, especially when it comes to avoiding bias. Further, I will argue that there is a role for humanists and others who have been concerned with the datafication of the cultural record for some time. Not only have we traditionally been concerned with social, political and ethical issues, but we have developed practices around the curation of the cultural record. We need to ask about the ethics around big data and the creation of training sets. We need to advocate for an ethic of care and repair when it comes to digital archives that can have cascading impact.

About the speaker – Geoffrey Rockwell is a Professor of Philosophy and Digital Humanities, Director of the Kule Institute for Advanced Study and Associate Director of AI for Society signature area at the University of Alberta. He publishes on textual visualization, text analysis, ethics of technology and on digital humanities including a co-authored book Hermeneutica from MIT Press (2016). He is co-developer of Voyant Tools (voyant-tools.org), an award winning suite of text analysis tools. He is currently the President of the Canadian Society for Digital Humanities.