Here is a comprehensive response to your query, drawing on the provided sources:

Data Visualization Guidelines

Data visualization is crucial for Business Intelligence & Analytics (BI&A) because it helps to present complex information in a graphical form (e.g., charts, maps, flow diagrams) to facilitate understanding and pattern recognition [1]. It is key for rapid understanding and communication of insights derived from data analysis [1].

Digital dashboards, for instance, are designed to present key information on a single screen, are typically highly personalized, and display aggregated data (though users can often drill down for more detail) [2, 3]. The primary purpose of visualization is to make complex data accessible and actionable for users, especially for decision-making [1].

Big Data Definition and Characteristics

Big Data refers to data that cannot be handled with traditional approaches, tools, or technologies [4]. It is characterized by its immense scale and scope, often exemplified by the data generated by large digital companies like Google, Amazon, and Facebook [5].

The concept of Big Data is commonly described by The Four V's [5]:

Big Data Technology Categories

When dealing with Big Data, technologies primarily fall into three key categories [8]:

  1. Storage: Addresses the volume component [8].
  2. Processing: Aligns with the velocity component [8].
  3. Analytics: Methods to gain insights from stored and processed information, spanning both volume and variety [8].

Specific Problems That Have to Be Resolved (for Big Data)

Cloud BI – Why Is It Important, Disadvantages

Advantages

Disadvantages: Not explicitly listed in the sources.

Columnar Storage

Columnar storage stores data by columns rather than rows [19]. The primary key maps back to row IDs, optimizing analytical queries.

Main advantages include:

Data Lake vs. Data Warehouse

Data Lakes hold massive raw data in native formats (schema-on-read), designed for low-cost storage and data science exploration [23–27]. They complement, not replace, data warehouses.

Feature Data Warehouse (DWH) Data Lake
Nature of Data Structured, processed Any raw/native format
Schema On-write (predefined) On-read / NoSQL
Data Preparation Up front (ETL) On demand (ELT)
Costs Expensive at scale Low-cost storage
Agility Fixed configuration Highly agile
Users Business professionals Data scientists

One Platform vs. Multiple

Data Fabric (Single)

Data Mesh (Decentralized)

BI&A Information Quality

Focusing on two of Eppler’s dimensions [35]:

1. Consistency

Problem: Fragmented systems yield multiple “truths” (e.g., differing customer addresses) [36–39].

Solution: A DWH provides one source of truth via ETL cleansing; MDM creates a golden record [40–51].

2. Comprehensiveness

Problem: Siloed operational data limits holistic analysis (e.g., marketing vs. sales) [45, 52–55].

Solution: BI&A integrates multiple sources into a unified warehouse, enriching analysis with external data [35, 41, 45, 59, 60].

Encouraging BI System Adoption

DAX Concepts

DAX is used in models (e.g., Power BI) for calculated measures, columns, and tables. Context is key.

Calculated Columns vs. Measures

Filtering Contexts

The Total Is Not Always the Sum!

Iterators (X-functions)

Relational Functions

Filter Manipulation

Identifying Non-Working Code