Written by students who passed Immediately available after payment Read online or as PDF Wrong document? Swap it for free 4.6 TrustPilot
logo-home
Class notes

Class notes SQL

Rating
-
Sold
-
Pages
25
Uploaded on
19-06-2021
Written in
2020/2021

This document contains all the notes for the information management subject. Most importantly introduction to UML, XML, UMl to XML conversion, DTD, XQUERY

Institution
Course

Content preview

Introduction to Information Management

## Some Core Concepts

**Organisation:** How data is represented/associated.

**Metadata:** Data about what the data is.

**Access:** How to interact with the data efficiently.



## What is the Difference Between Data, Information and Knowledge

**Data:**

- Raw; Building blocks of information.

- Unprocessed information.



**Information:**

- Data associated together to convey some meaning.

- Basic unit of communication.



**Knowledge:**

- Interrelating and "understanding" information.



## Maintaining Structure in Your Own Data File

Files just represent data as a series of bytes and will *lose the structure* that you might have
imposed either logically or physically unless you do something about it.



There are many ways of adding structure to files, for example:

- **Delimited-text Field:** Choose a special character/delimiter that will not appears as a legitimate
character within the information field and then insert that character into the file after writing each
field.

- **Fixed-length Field:** Use a fixed length for each information field and pad out when the length
of the actual data value is less than the fixed length.

- **Length-based Field:** Write the length of the value of the information field followed by the
value in exactly that number of bytes.

- **Identified Field:** Write the name of the information field and then value both represented as
delimited-text fields.

,## Turning Data into information

There are two distinct approaches:

- **Structured Approach:** Deliberately associate data together to turn it into information. *E.g.
excel, databases, etc.*

- **Unstructured Approach:** Bring loosely managed data together to serve specific information
needs using information retrieval techniques. *E.g. search engines.*



## Nature of Querying

**Structured:** Uses artificial language with known data types and exact criteria.

**Unstructured:** Keyword and phrase based.



## Nature of Results

**Structured:**

- Returns definitive results.

- Returns the complete set of data that meets search criteria.

- No estimation of relevancy



## Structured Approach Specialist Software

### Databases (DBs)

- A combination of software and hardware.

- Optimised to reduce data to storage transfer.

- Optimised to provide Transactional/ACID (Atomic, Consistent, Isolated, Durable).

- Designed to be administered and secure.

- Different Models:

- Relational.

- Networked.

- Hierarchical.

- Object-oriented



### DataWarehouses (DWs)

, DataWarehouse is a subject oriented, integrated, nonvolatile, time-variant collection of data in
support of management's decisions.



DataWarehouse is a repository of data which is:

- Separate from operational systems and populated by data from these systems.

- Provides a trend view of data.

- Available entirely for the task of making data available to be interrogated by business users.

- **Timestamped** and associated with defined periods of time.

- Accessible to users who have a limited knowledge of computer systems or data structures.



Used for:

- Data mining.

- Decision Support.

- OLAP.



## Unstructured Approach Specialist Software

### Information Retrieval

Fundamental concerns:

- Efficient Access.

- "Relevant" results through matching



## Common Challenges managing data for Enterprises and Individuals

**Variety:** Data extends beyond structured data, including semi-structured and unstructured data
of all varieties.

**Volume:** There is an insane amount of data in the world.

**Velocity:** Often time-sensitive, data must be processed as it is streaming in order to maximise
its value.

**Validity:**

- **Data Protection:** Consent and compliance.

- **Data Privacy:** What data an individual is willing to share.

- **Data Ethics:** Consideration of ethical issues when processing data.

Written for

Institution
Course

Document information

Uploaded on
June 19, 2021
Number of pages
25
Written in
2020/2021
Type
Class notes
Professor(s)
Na
Contains
All classes

Subjects

$25.12
Get access to the full document:

Wrong document? Swap it for free Within 14 days of purchase and before downloading, you can choose a different document. You can simply spend the amount again.
Written by students who passed
Immediately available after payment
Read online or as PDF

Get to know the seller
Seller avatar
jitishb

Get to know the seller

Seller avatar
jitishb Trinity College Dublin
Follow You need to be logged in order to follow users or courses
Sold
1
Member since
4 year
Number of followers
1
Documents
2
Last sold
4 year ago

0.0

0 reviews

5
0
4
0
3
0
2
0
1
0

Why students choose Stuvia

Created by fellow students, verified by reviews

Quality you can trust: written by students who passed their tests and reviewed by others who've used these notes.

Didn't get what you expected? Choose another document

No worries! You can instantly pick a different document that better fits what you're looking for.

Pay as you like, start learning right away

No subscription, no commitments. Pay the way you're used to via credit card and download your PDF document instantly.

Student with book image

“Bought, downloaded, and aced it. It really can be that simple.”

Alisha Student

Working on your references?

Create accurate citations in APA, MLA and Harvard with our free citation generator.

Working on your references?

Frequently asked questions