1. What do you mean by collisions in a hash table? How can they be avoided?

A collision in a hash table occurs when two different keys hash to the same index in the array. Since each index is meant to store one entry, a collision must be resolved.

Ways to avoid/handle collisions:


2. What are the ways to detect outliers in data?

Outliers can be detected using:


3. What are different ways to deal with outliers?

Outliers can be handled in several ways depending on the context:


4. What are some key skills required for a data analyst?

Core skills for a data analyst include:


5. What is the data analysis process?

The typical data analysis process involves:

  1. Define the objective: Understand the business problem.
  2. Data collection: Gather data from various sources.
  3. Data cleaning: Handle missing, duplicate, or incorrect values.
  4. Data exploration: Perform EDA (Exploratory Data Analysis) to understand patterns.
  5. Analysis/modeling: Use statistical or machine learning models to analyze.
  6. Interpretation: Derive insights and patterns.
  7. Visualization & reporting: Present findings using graphs, charts, or dashboards.
  8. Action: Make data-driven decisions or recommendations.

6. What challenges are faced during data analysis?

Data analysts often face the following challenges:


7. What is data cleansing?

Data cleansing (or cleaning) is the process of identifying and correcting errors or inconsistencies in the data to improve its quality.

Key tasks in data cleansing:


8. What are the tools useful for data analysis?

Popular tools used by data analysts include:


9. What is the difference between data mining and data profiling?

AspectData MiningData Profiling
PurposeDiscover hidden patterns and insightsUnderstand structure, quality, and content
FocusPredictive analysis, pattern recognitionData assessment and summary statistics
Techniques UsedML algorithms, clustering, association rulesMetadata analysis, frequency/distribution check
OutputActionable insights and trendsData quality reports

10. Which validation methods are employed by data analysts?

Common data validation methods include:


11. What is an outlier?

An outlier is a data point that differs significantly from other observations. It may be due to variability in the data or an error.

Example: In a dataset of student ages where most are between 18–22, a value of 45 could be an outlier.

Why outliers matter:


Retrieve customers who made consecutive purchases (2 Days)

WITH cte AS (
SELECT
id,
order_date,
LAG(order_date) OVER (
PARTITION BY id
ORDER BY order_date
) AS prev_order_date
FROM Orders
)
SELECT
id,
order_date,
prev_order_date
FROM cte
WHERE DATEDIFF(DAY, prev_order_date, order_date) = 1;

Do the payment at the QR Code below

Submission Successful. Our Team will review it in 1 hour, and contact you.