Lecture Notes in Computer Science 1 Temporal Data Mining an overview

格式：pdf
大小：207.87 KB
文档页数：15

下载文档原格式

/ 15

1、下载文档前请自行甄别文档内容的完整性，平台不提供额外的编辑、内容补充、找答案等附加服务。
2、"仅部分预览"的文档,不可在线预览部分如存在完整性等问题,可反馈申请退款(可完整预览的文档不适用该条件!)。
3、如文档侵犯您的权益，请联系客服反馈,我们会尽快为您处理(人工客服工作时间：9:00-18:30)。

Lecture Notes in Computer Science 1 Temporal Data Mining: an overview

Cláudia M. Antunes 1, and Arlindo L. Oliveira 2

1 Instituto Superior Técnico, Dep. Engenharia Informática, Av. Rovisco Pais 1,

1049-001 Lisboa, Portugal

cman@rnl.ist.utl.pt

2 IST / INESC-ID, Dep. Engenharia Informática, Av. Rovisco Pais 1,

1049-001 Lisboa, Portugal

aml@inesc.pt

Abstract. One of the main unresolved problems that arise during the data

mining process is treating data that contains temporal information. In this case,

a complete understanding of the entire phenomenon requires that the data

should be viewed as a sequence of events. Temporal sequences appear in a vast

range of domains, from engineering, to medicine and finance, and the ability to

model and extract information from them is crucial for the advance of the

information society. This paper provides a survey on the most significant

techniques developed in the past ten years to deal with temporal sequences.

1 Introduction

With the rapid increase of stored data, the interest in the discovery of hidden information has exploded in the last decade. This discovery has mainly been focused on data classification, data clustering and relationship finding. One important problem that arises during the discovery process is treating data with temporal dependencies. The attributes related with the temporal information present in this type of datasets need to be treated differently from other kinds of attributes. However, most of the data mining techniques tend to treat temporal data as an unordered collection of events, ignoring its temporal information.

The aim of this paper is to present an overview of the techniques proposed to date that deal specifically with temporal data mining. Our objective is not only to enumerate the techniques proposed so far, but also to classify and organize them in a way that may be of help for a practitioner looking for solutions to a concrete problem. Other overviews of this area have appeared in the literature ([25], [56]), but is not widely available, while the second one uses a significantly different approach to classify the existing work and is considerably less comprehensive than the present work.

The paper is organized as follows: Section 2 defines the main goals of temporal data mining, related problems and concepts and its applications areas. Section 3 presents the principal approaches to represent temporal sequences, and Section 4 the methods developed to measure the similarity between two sequences. Section 5 addresses the problems of non-supervised sequence classification and relation

Lecture Notes in Computer Science 1 Temporal Data Mining an overview

合集下载

文档推荐

最新文档