A Reference Model for Big Data

January 5, 2018 | Author: Anonymous | Category: Engineering & Technology, Computer Science, Data Mining
Share Embed Donate


Short Description

Download A Reference Model for Big Data...

Description

32N2386

Next Generation Analytics & Big Data (A Reference Model for Big Data) Jangwon Gim Sungjoon Lim Hanmin Jung ISO/IEC JTC1 SC32 Ad-hoc meeting May 29, 2013, Gyeongju Korea

Contents      

Background Brief history of discussions Case study Procedure for developing standardizations for Big Data Reference model for Big Data Conclusions

2

Discussion of Big Data  Data analytics  Data analysis  Baba: Vocabulary, Use-case, and so on  Stabilize Architecture  Define Interfaces  Standardization opportunities

    

Jim: The aspect of Big Data is “There is many different forms” Krishna: Refers to Wikipedia definition Keith Gorden: Volume, Complex, Velocity Keith W. Hare: Open Big Data  Volume, Variety, Velocity, Value, Veracity Any combination is OK.

3

Background  Emerging Technologies For Big Data  In 2012, The hype cycle of Gartner

 Diverse definitions of technologies and services, having different views of data

4

Background  Big Data on hype cycle

 A general and common reference model for Big Data is needed 5

Brief history of discussions Issue Date

Summary

16 November 2011.

[SC32N2181] ISO/IEC JTC 1/SC 32 N2181, “Resolutions and topics from the recent JTC 1 me eting of particular interest to SC 32 participants”, SC32 Chair – Jim Melton

12 January 2012.

[SC32N2198] ISO/IEC JTC 1/SC 32 N 2198, “Analysis of 2012 Gartner Technology Trends”, JTC1 SWG-P - Mario Wendt – Convener  SC 6 Telecommunications and information exchange between systems  SC 32 Data management and interchange  SC 39 Sustainability for and by Information Technology

19 March 2012.

[SC32N2199]ISO/IEC JTC 1/SC 32 N 2199, “Discussion: SC 32 Response to 2011 JTC 1 Resolution 33”, SC32 Chair – Jim Melton

6 June 2012.

[SC32N2241] Ad-hoc on “Next gen analytics” - Keith Hair - Chair

6

The view of Next-Generation Analytics of SC32  Referencing from [SC32N2241]

Architectural Next-Generation Analytics Social Analytics From Baba

Mechanisms Metadata Raw Storage

 Need a reference model for Big Data to enhance interoperability

7

Case Study (1)  Korea Institute of Science and Technology (KISTI)  Dept. of Computer Intelligence Research

8

Case Study (2)  Architecture of InSciTe Adaptive Service

9

Case Study (3)  Semantic Analysis  Text Data to Ontology

10

Case Study (4)  Semantic Analysis  Ontology Schema

11

Case Study (5)  Semantic Analysis  Example of Semantic Analysis

12

Case Study (6)  InSciTe Service Functions – (Hybrid Vehicle) Technology Navigation

Technology Trend

Core Element Technology

Convergence Technology

Agent Level

Agent Partner

Integrated Roadmap

Report

13

Case Study (7)  In 2013, About 10 Billion triples from diverse sites will be extracted

Sites Freebase

The number of Count 1,015,762,951

Yago

224,949,079

DBPedia

449,383,705

DBLP

81,986,947

baseKB

147,549,529

Etc (WhoisWho,NYTimes,LinkedObervedData,…)

2,296,838,760

Total

4,216,470,971

14

Case Study (8)  In 2013, System Architecture of InSciTe Adaptive Service

15

Procedure for developing a reference model for Big Data

1. Eliciting requirements and analyzing the environment of Big Data

2. Establishing visions and strategies for achieving the goal of Big Data We are here

3. Defining a concept model / a reference model / a framework for Big Data

4. Deriving use-cases for applying the Big Data

16

A lifecycle of Big Data

• Collection/Identification • Repository/Registry • Semantic 1. Intellectualization • Integration Data

• Analytics / Prediction

2. • Visualization

Insight

Big Data Action • Data Curation • Data Scientist • Data Engineer

3.

Decision

4.

• Workflow • Data Quality

17

Reference Model for Big Data  A Reference Model for Big Data

Service Layer

Analysis & Prediction

Big Data Management

Interface

Workflow Management

Data Quality Management

Data Visualization

Service Support Layer

Interface

Data Curation

Data Integration Platform Layer

Data Semantic Intellectualization

Security

Interface

Data Identification (Data Mining & Metadata Extraction) Data Collection

Data Registry

Data Layer

Data Repository

18

Reference Model for Big Data  A Reference Model for Big Data

??? 19763 Service Layer

Analysis & Prediction

Big Data Management

Interface

Workflow Management

Data Quality Management

Data Visualization

Service Support Layer

Interface

Data Curation

Data Integration Platform Layer

Data Semantic Intellectualization Interface

9075

Security

13249

Data Identification (Data Mining & Metadata Extraction) Data Collection

Data Registry

Data Layer

Data Repository

11179 19

Conclusions  Summary  Analyzing the circumstance of Big Data  Building a framework for Big Data  Define detail procedure to create the Big Data

 Discussion  Possible suggestions • New Working Group for the reference model of Big Data  New Work Items could be derived from the model • New Study Group

 Future work  Discussion of the concept of NWI • 2013. 11. Interim meetings

 Propose extended the reference model of Big Data (NWI) • 2014. 5 Plenary meeting

20

View more...

Comments

Copyright � 2017 NANOPDF Inc.
SUPPORT NANOPDF