Welcome, dear student! In this unit, we will learn about Geographical Enquiry and Map Making. These are two of the most important skills a geography student can develop. Have you ever wondered how geographers find out about the world? How do they collect information, analyze it, and present it on a map? That is exactly what this unit is all about. Let us begin step by step.
8.1 Fundamentals of Research in Geography
8.1.1 What is Geographical Enquiry?
Geographical enquiry is the process of asking questions about the Earth and its features, and then systematically investigating those questions to find answers. Geographers ask questions like: What is it? Where is it? Why is it there? How does it affect people? These questions help us understand the world around us.
A geographical enquiry follows a structured approach, much like a scientific investigation. It is not just about collecting facts — it is about understanding the relationships between people, places, and environments.
Can you think of a geographical question about your own town or city? For example: “Why are more shops located near the main road than in the residential areas?” That is a geographical question!
8.1.2 Steps in Geographical Research
Geographical research follows several clear steps. Let us go through each one carefully.
Step 1: Identifying the Problem (Topic Selection)
The first step is to decide what you want to study. This is called identifying the research problem or selecting a topic. The topic should be clear, specific, and related to geography. For example, instead of saying “I want to study rivers,” a better topic would be: “The causes of flooding in the Awash River Basin.”
A good research problem should be:
- Clear: Easy to understand
- Specific: Not too broad
- Researchable: Possible to collect data on it
- Relevant: Important and useful
Practice Question: Which of these is a better research topic? (a) “Study of climate” or (b) “The impact of rainfall variability on crop production in Amhara Region.” Why?
Step 2: Formulating a Hypothesis
A hypothesis is a tentative answer to your research question. It is an educated guess that you will test through your research. A hypothesis should be stated clearly and in a way that can be tested.
Research Question: “Does deforestation increase soil erosion in the Ethiopian highlands?”
Hypothesis: “Deforestation increases the rate of soil erosion in the Ethiopian highlands.”
A hypothesis can be written in different ways. The most common forms are:
- Directional hypothesis: States the expected direction of the relationship. Example: “Increased rainfall leads to higher crop yields.”
- Non-directional hypothesis: States there is a relationship but does not specify the direction. Example: “There is a relationship between rainfall and crop yields.”
- Null hypothesis (H₀): States there is NO significant relationship. Example: “There is no significant relationship between deforestation and soil erosion.”
- Alternative hypothesis (H₁): States there IS a significant relationship. Example: “There is a significant relationship between deforestation and soil erosion.”
The null hypothesis is very important because in research, we test the null hypothesis. If we reject it, we accept the alternative hypothesis.
• A hypothesis must be testable and falsifiable.
• It should be concise and clear.
• It predicts the relationship between variables.
• The null hypothesis (H₀) says “no effect” or “no relationship.”
• The alternative hypothesis (H₁) says “there is an effect” or “there is a relationship.”
Question 1: A student writes: “Soil erosion is bad.” Is this a good hypothesis? Explain.
Answer: No, this is not a good hypothesis. A hypothesis must be testable and specific. “Soil erosion is bad” is a value judgment, not a testable statement. A better hypothesis would be: “Areas with steeper slopes experience higher rates of soil erosion than areas with gentle slopes.” This can be tested by measuring slope angles and erosion rates in different areas.
Question 2: Write a null hypothesis and an alternative hypothesis for the research question: “Does temperature affect the rate of evaporation from Lake Ziway?”
Answer:
H₀ (Null): There is no significant relationship between temperature and the rate of evaporation from Lake Ziway.
H₁ (Alternative): There is a significant relationship between temperature and the rate of evaporation from Lake Ziway.
Question 3: Why is the null hypothesis important in geographical research?
Answer: The null hypothesis is important because it provides a baseline that can be tested using statistical methods. Researchers collect data and analyze it to determine whether to reject or fail to reject the null hypothesis. This approach ensures that conclusions are based on evidence, not opinion. It also helps avoid bias in research.
Step 3: Review of Related Literature
Before collecting your own data, you must review what other researchers have already written about your topic. This is called a literature review. It helps you:
- Understand what is already known about the topic
- Avoid repeating work that has already been done
- Identify gaps in knowledge that your research can fill
- Learn about the methods other researchers used
Sources for literature review include: textbooks, journal articles, government reports, previous theses, and reliable internet sources.
Think about it: If you are studying soil erosion in your area, why would it be helpful to read what previous students or scientists have written about soil erosion in similar areas?
Step 4: Research Design and Methodology
This step involves planning how you will conduct your research. You need to decide:
- Research approach: Will you use a quantitative approach (numbers, measurements, statistics) or a qualitative approach (words, descriptions, observations)? Or both?
- Data collection methods: How will you gather information? (We will discuss this in detail below.)
- Sampling technique: Who or what will you study? How will you select your sample?
- Data analysis methods: How will you analyze the data you collect?
Step 5: Data Collection
Data collection is the process of gathering information to answer your research question. This is a very important step. Let us look at the types of data and methods of collection in detail.
8.1.3 Types of Data
Data in geographical research can be classified in several ways:
A. Primary Data vs. Secondary Data
| Primary Data | Secondary Data |
|---|---|
| Collected firsthand by the researcher | Already collected by someone else |
| Original and specific to your study | May be general, not specific to your study |
| More reliable for your specific purpose | Less expensive and less time-consuming |
| Examples: surveys, interviews, field observations, measurements | Examples: census data, government reports, published maps, textbooks |
B. Quantitative Data vs. Qualitative Data
| Quantitative Data | Qualitative Data |
|---|---|
| Expressed in numbers | Expressed in words |
| Can be measured and counted | Describes qualities and characteristics |
| Analyzed using statistics | Analyzed using themes and patterns |
| Examples: temperature, rainfall amount, population number, distance | Examples: people’s opinions, land use descriptions, soil color, vegetation type |
• Primary data = you collect it yourself (fieldwork).
• Secondary data = someone else collected it.
• Quantitative = numbers (measurable).
• Qualitative = words/descriptions (observable qualities).
• Most geographical research uses BOTH primary and secondary data.
8.1.4 Methods of Data Collection
There are several methods of collecting primary data in geography:
1. Field Observation
This is the most basic method. You go to the study area and observe directly what is happening. For example, you might observe the types of crops grown, the condition of roads, or the pattern of settlements. Observations can be:
- Structured observation: You have a checklist of specific things to look for.
- Unstructured observation: You observe freely without a fixed checklist.
- Participant observation: You participate in the activity while observing (for example, living in a village to understand farming practices).
2. Questionnaire (Survey)
A questionnaire is a set of written questions given to people to answer. It is useful for collecting data from many people quickly. Questionnaires can contain:
- Closed-ended questions: Respondents choose from given options. Example: “What is your main source of income? (a) Farming (b) Trading (c) Employment (d) Other”
- Open-ended questions: Respondents write their own answers. Example: “What challenges do you face in farming?”
- Likert scale questions: Respondents rate their agreement. Example: “Soil erosion is a serious problem here: [Strongly Disagree / Disagree / Neutral / Agree / Strongly Agree]”
3. Interview
An interview involves asking questions directly to people face to face. Interviews can be:
- Structured interview: Fixed questions, same for every respondent.
- Unstructured interview: Flexible, like a conversation.
- Semi-structured interview: Some fixed questions, but allows follow-up questions.
4. Measurement and Instrumentation
This involves using instruments to measure physical quantities. Examples:
- Using a thermometer to measure temperature
- Using a rain gauge to measure rainfall
- Using a tape measure to measure distance or slope
- Using a GPS to record locations
5. Mapping and Sketching
Drawing maps and sketches of the study area is an important data collection method in geography. You can sketch land use patterns, settlement patterns, drainage systems, and more.
Practice: Imagine you want to study “The causes of deforestation around your town.” Which data collection methods would you use? List at least three and explain why.
• Field observation = direct watching and recording.
• Questionnaire = written questions for many respondents.
• Interview = face-to-face questioning (deeper information).
• Measurement = using instruments for physical data.
• Mapping = drawing sketches and maps of features.
• Always use MORE THAN ONE method for reliable results (this is called triangulation).
Question 1: A student wants to study the impact of industrial pollution on the water quality of a river near their city. Identify three appropriate methods of primary data collection and explain what each would be used for.
Answer:
(1) Measurement: Use water testing kits or instruments to measure pH, dissolved oxygen, turbidity, and chemical content of the river water at different points. This provides quantitative data on water quality.
(2) Field Observation: Visit the river at different locations to observe the color of the water, presence of algae or dead fish, industrial discharge points, and the general condition of the riverbanks. This provides qualitative descriptive data.
(3) Interview: Interview people living near the river (especially farmers who use the water) about changes they have noticed in water quality over time, health problems, and effects on their activities. This provides people’s perceptions and historical information.
Question 2: Differentiate between closed-ended and open-ended questionnaire questions. Give one example of each related to geographical enquiry.
Answer:
Closed-ended questions provide fixed response options. The respondent selects from the given choices. Example: “Which type of soil is found in your farm? (a) Clay (b) Sandy (c) Loam (d) I don’t know”
Open-ended questions allow respondents to answer in their own words without fixed options. Example: “What problems does the soil type in your farm cause for crop production?”
Closed-ended questions are easier to analyze statistically but limit responses. Open-ended questions allow detailed answers but are harder to analyze.
Question 3: What is triangulation in data collection, and why is it important?
Answer: Triangulation means using multiple data collection methods or multiple data sources to study the same research problem. For example, using both a questionnaire AND field observation AND interviews to study deforestation. It is important because: (1) It increases the reliability and validity of the research findings, (2) It allows cross-checking of information from different sources, (3) It reduces bias that might come from using only one method, (4) It provides a more complete and comprehensive understanding of the research problem.
8.1.5 Sampling Techniques
In most geographical research, it is impossible to study every single person, place, or feature. Instead, we study a sample — a smaller group that represents the larger group (called the population).
Key Definitions:
- Population: The entire group you want to study. Example: All farmers in Oromia Region.
- Sample: A subset of the population that you actually study. Example: 200 farmers selected from Oromia Region.
- Sampling frame: The list from which you select your sample. Example: A list of all registered farmers in selected districts.
- Sample size: The number of units in your sample.
Types of Sampling:
A. Probability Sampling — Every member of the population has a known, non-zero chance of being selected.
1. Simple Random Sampling: Every member has an equal chance of being selected. You can use a random number table or draw lots.
2. Systematic Sampling: You select every k-th member from a list after a random start.
3. Stratified Sampling: You divide the population into subgroups (strata) based on a characteristic, then randomly sample from each stratum proportionally.
4. Cluster Sampling: You divide the population into clusters (naturally occurring groups), randomly select some clusters, and study ALL members in those clusters.
B. Non-Probability Sampling — Not every member has a known chance of selection.
- Convenience sampling: Select whoever is easily available. (Not recommended for serious research.)
- Purposive (judgmental) sampling: Researcher selects members based on their knowledge or relevance.
- Snowball sampling: Existing participants refer other participants. Useful for hard-to-reach populations.
- Quota sampling: Like stratified sampling but selection within strata is non-random.
Sample Size Determination
A common formula for determining sample size for a large population is:
Where:
- $n$ = required sample size
- $N$ = population size
- $e$ = margin of error (level of precision), commonly $0.05$ (5%)
Worked Example: A researcher wants to study soil conservation practices among 2,000 farmers in a district. Using a 5% margin of error, what sample size is needed?
So the researcher needs a sample of approximately 333 farmers.
Try this: If the population is 500 farmers and the margin of error is 0.05, what is the required sample size? (Try calculating before looking at the answer!)
• Probability sampling = random selection, more representative, more scientific.
• Simple random = every member has equal chance.
• Systematic = every k-th member after random start; k = N/n.
• Stratified = divide into strata, then sample proportionally.
• Cluster = select entire clusters randomly.
• Sample size formula: n = N / (1 + Ne²)
• Always specify your sampling method and justify your choice in research.
Question 1: A researcher wants to survey 800 households in a town. Using a 5% margin of error, calculate the required sample size using the formula n = N/(1+Ne²).
Answer:
N = 800, e = 0.05
n = 800 / (1 + 800 × 0.05²)
n = 800 / (1 + 800 × 0.0025)
n = 800 / (1 + 2)
n = 800 / 3
n = 266.67 ≈ 267 households
Question 2: Explain the difference between stratified sampling and cluster sampling with a geographical example.
Answer:
Stratified sampling: The population is divided into homogeneous subgroups (strata) based on a specific characteristic, and then a random sample is taken FROM EACH stratum. Example: To study land use in a district, divide farmers into strata based on farm size (small, medium, large), then randomly sample farmers from each size category. Every stratum is represented in the sample.
Cluster sampling: The population is divided into naturally occurring groups (clusters), then ENTIRE clusters are randomly selected and ALL members in those clusters are studied. Example: To study land use, divide the district into kebeles (clusters), randomly select 5 kebeles, and survey ALL farmers in those 5 kebeles. Only selected clusters are studied.
Key difference: In stratified sampling, a few members are taken from EVERY stratum. In cluster sampling, ALL members are taken from SELECTED clusters.
Question 3: Why is simple random sampling considered the most unbiased sampling technique?
Answer: Simple random sampling is considered the most unbiased because: (1) Every member of the population has exactly the same probability of being selected, (2) There is no human judgment involved in the selection process — it relies purely on chance, (3) It minimizes selection bias because the researcher cannot favor any particular member, (4) It satisfies the statistical assumption of equal probability, which is important for many statistical tests. However, it requires a complete sampling frame (list of all members), which may not always be available.
8.1.6 Data Analysis and Presentation
After collecting data, the next step is to analyze and present it so that it can be understood easily. This is where your data becomes meaningful.
For Quantitative Data:
- Descriptive statistics: Mean, median, mode, range, standard deviation
- Measures of central tendency:
Worked Example: Calculating Mean and Range
A student recorded the following daily rainfall amounts (in mm) at a weather station for one week: 12, 15, 0, 8, 22, 18, 5
For Qualitative Data:
- Organize into themes, categories, or patterns
- Use quotations from respondents to support points
- Describe and explain findings in words
Methods of Data Presentation in Geography:
- Tables: Organize data in rows and columns for easy comparison
- Bar graphs: Compare quantities across categories
- Line graphs: Show trends over time
- Pie charts: Show proportions of a whole
- Scatter plots: Show relationship between two variables
- Maps: Show spatial distribution of data
- Flow diagrams: Show processes or movement
- Photographs and sketches: Provide visual evidence
Think about it: If you collected data on monthly temperature for one year, which type of graph would be most appropriate? Why?
• Mean = sum of all values ÷ number of values.
• Median = middle value when data is arranged in order.
• Mode = most frequently occurring value.
• Range = highest − lowest value.
• Standard deviation measures spread/dispersion of data.
• Line graph = trends over time. Bar graph = comparison. Pie chart = proportions. Map = spatial distribution.
• Always give every table, graph, and map a clear title and labels.
Question 1: The following data shows the population (in thousands) of five towns in Ethiopia: Town A = 120, Town B = 85, Town C = 210, Town D = 65, Town E = 150. Calculate the mean population and the range.
Answer:
Mean = (120 + 85 + 210 + 65 + 150) / 5 = 630 / 5 = 126 thousand
Range = 210 − 65 = 145 thousand
Question 2: A student collected monthly rainfall data for 12 months. Which type of graph should they use to present this data? Justify your answer.
Answer: A line graph is the most appropriate. Reason: Monthly rainfall data is time-series data (data collected over successive time periods). A line graph clearly shows the trend of rainfall over the 12 months — when it increases, decreases, and peaks. It allows the reader to see seasonal patterns at a glance. A bar graph could also work but would not show the continuous trend as effectively. A pie chart would NOT be appropriate because it shows proportions of a whole, not changes over time.
Question 3: Explain the difference between the mean and the median. When might the median be more useful than the mean?
Answer:
Mean is the arithmetic average — the sum of all values divided by the number of values. It uses every data point.
Median is the middle value when data is arranged in ascending or descending order.
The median is more useful than the mean when the data contains extreme values (outliers). For example, if household incomes in a village are: 5,000, 6,000, 5,500, 7,000, and 200,000 birr (one very wealthy family), the mean would be 44,700 birr, which does not represent most households. The median would be 6,000 birr, which is a much better representation of the typical household income. In geography, the median is preferred for data with extreme values like income, land size, or population of settlements.
8.1.7 Drawing Conclusions and Writing the Report
After analyzing your data, you need to:
- Interpret the results: What do the findings mean? Do they answer your research question?
- Accept or reject the hypothesis: Based on the evidence, is your hypothesis supported?
- Draw conclusions: State clearly what you found out.
- Make recommendations: Suggest actions based on your findings.
- Write the research report: Present your entire study in a structured format.
Structure of a Geographical Research Report:
- Title page
- Acknowledgment
- Table of contents
- Introduction (background, problem statement, objectives, hypothesis)
- Literature review
- Study area description
- Methodology (research design, data collection, sampling)
- Data presentation and analysis
- Discussion of findings
- Conclusions and recommendations
- References
- Appendices (questionnaires, extra maps, photos)
8.2 GIS Data and Map Making Using GIS
8.2.1 What is GIS?
GIS stands for Geographic Information System. It is a computer-based system for storing, managing, analyzing, and displaying geographic data. In simple words, GIS helps us put information on a map and understand patterns and relationships in that information.
A GIS is not just a map — it is a system that links data to locations on the Earth’s surface. For example, a GIS can show not only where hospitals are located, but also how many patients each hospital serves, what diseases are common in each area, and how far people travel to reach the nearest hospital.
Have you ever used Google Maps or a navigation app on your phone? That is a simple form of GIS! It shows your location, routes, and places of interest — all linked to geographic coordinates.
• GIS = Geographic Information System.
• It is a computer-based tool for capturing, storing, analyzing, and displaying spatial data.
• GIS links DATA to LOCATION on the Earth’s surface.
• It helps us see patterns, relationships, and trends that are not visible in tables or text.
• GIS has five components: hardware, software, data, people, and methods.
8.2.2 Components of GIS
A complete GIS has five main components:
1. Hardware: The physical equipment used to run GIS. This includes:
- Computer (desktop, laptop, or server)
- GPS receiver (for collecting location data in the field)
- Digitizer (for converting paper maps to digital format)
- Printer/plotter (for printing maps)
- Storage devices (hard drives, external drives)
2. Software: The programs that run on the hardware. GIS software includes:
- ArcGIS (by Esri) — the most widely used commercial GIS software
- QGIS — free and open-source GIS software
- Google Earth — for viewing satellite imagery
- Database management software (for storing data)
3. Data: The most important component! GIS is useless without data. GIS data includes:
- Spatial data (where things are — coordinates, boundaries)
- Attribute data (what things are — population, temperature, land use type)
4. People: GIS requires trained people who can:
- Collect and enter data
- Manage the GIS system
- Analyze data and create maps
- Interpret results and make decisions
5. Methods: The rules, procedures, and models used to operate the GIS. This includes:
- Data standards (how data should be formatted)
- Analysis procedures (step-by-step methods)
- Quality control measures
Question 1: List and briefly explain the five components of a Geographic Information System.
Answer:
(1) Hardware: The physical equipment — computers, GPS receivers, digitizers, printers, and storage devices needed to run GIS.
(2) Software: The computer programs such as ArcGIS, QGIS, or Google Earth that process, analyze, and display geographic data.
(3) Data: The geographic and attribute information that is fed into the system. Without data, GIS cannot function. This includes both spatial data (locations) and non-spatial data (characteristics).
(4) People: The human users who operate the GIS — data collectors, GIS analysts, map makers, and decision makers who interpret the output.
(5) Methods: The procedures, rules, and analytical models that guide how the GIS is used — including data standards, analysis workflows, and quality control procedures.
Question 2: Why is data considered the most important component of GIS?
Answer: Data is the most important component because: (1) Without data, the hardware and software have nothing to process or display — the system is empty, (2) The quality of the output (maps, analysis results) depends entirely on the quality of the input data — “garbage in, garbage out,” (3) GIS is specifically designed to handle geographic data, so data is the reason the system exists, (4) Even the most expensive hardware and most advanced software cannot produce useful results without accurate, relevant data. All other components exist to serve the data.
8.2.3 Types of GIS Data
GIS data is classified into two main types based on how it represents features:
A. Vector Data
Vector data represents features using points, lines, and polygons:
- Points: Represent features that have a location but no significant size. Examples: a school, a hospital, a borehole, a weather station.
- Lines: Represent features that have length but no significant width. Examples: a road, a river, a railway line, a boundary between two regions.
- Polygons: Represent features that have both area and boundary. Examples: a lake, a forest, a district boundary, a farmland, a building footprint.
B. Raster Data
Raster data represents the Earth’s surface as a grid of cells (pixels), where each cell has a value. Raster data is used for continuous surfaces.
- Each cell = one pixel with a value
- Examples: satellite images, aerial photographs, digital elevation models (DEM), temperature surfaces, rainfall maps
- Higher resolution = smaller cell size = more detail
| Feature | Vector Data | Raster Data |
|---|---|---|
| Representation | Points, lines, polygons | Grid of cells (pixels) |
| Best for | Discrete features (roads, buildings, boundaries) | Continuous surfaces (temperature, elevation, satellite images) |
| File size | Usually smaller | Usually larger |
| Resolution | Scale-independent (can zoom without losing quality) | Resolution-dependent (pixelation at high zoom) |
| Examples | Roads, rivers, land parcels, city points | Satellite imagery, DEM, rainfall grid |
Attribute Data (Non-Spatial Data)
In addition to spatial data (location), every GIS feature has attribute data — the descriptive information about the feature stored in a table. This is stored in an attribute table.
• Vector = points, lines, polygons (discrete features).
• Raster = grid of cells/pixels (continuous surfaces).
• Attribute data = descriptive information linked to spatial features.
• Attribute table links spatial data to non-spatial data using a common ID.
• Vector is better for boundaries, roads, buildings.
• Raster is better for satellite images, elevation, temperature surfaces.
Question 1: For each of the following geographical features, state whether it would best be represented as a point, a line, or a polygon in vector GIS data: (a) A borehole, (b) The Awash River, (c) Abijatta-Shalla Lakes National Park, (d) A highway between Addis Ababa and Adama, (e) A health center, (f) A coffee plantation.
Answer:
(a) Borehole → Point (it is a specific location with no significant area)
(b) Awash River → Line (it has length but in GIS at a small scale, its width is not significant)
(c) Abijatta-Shalla Lakes National Park → Polygon (it covers a defined area with boundaries)
(d) Highway between Addis Ababa and Adama → Line (it has length but negligible width on a map)
(e) Health center → Point (specific location)
(f) Coffee plantation → Polygon (it covers an area with boundaries)
Question 2: Explain the difference between spatial data and attribute data in GIS. Give an example of each for a river feature.
Answer:
Spatial data describes WHERE a feature is located on the Earth’s surface. It includes coordinates (latitude and longitude), shape, and geometry. For a river: the spatial data would be the series of coordinate points that trace the path of the river on a map (its line geometry).
Attribute data describes WHAT the feature is — its characteristics and properties. It is non-spatial and is stored in an attribute table. For a river: the attribute data would include the river’s name (Awash), length (1,200 km), discharge rate, water quality, and whether it is perennial or seasonal.
In GIS, spatial data and attribute data are linked together through a unique identifier (ID) in the attribute table.
Question 3: A student wants to create a map showing temperature distribution across Ethiopia. Should they use vector or raster data? Justify your answer.
Answer: They should use raster data. Reason: Temperature is a continuous variable — it exists at every point on the Earth’s surface and changes gradually across space. Raster data represents continuous surfaces as a grid of cells, where each cell holds a temperature value. This is ideal for showing how temperature varies smoothly across the country. Vector data (points, lines, polygons) is better for discrete features with defined boundaries, which does not suit temperature distribution well. However, they might use point data (weather stations) as the INPUT and then interpolate to create a raster surface.
8.2.4 Geographic Coordinate Systems and Map Projections
To put data on a map, we need a system for defining locations on the Earth’s surface. There are two main systems:
A. Geographic Coordinate System (Latitude and Longitude)
The Earth’s surface is divided using a grid of lines:
- Latitude: Lines running east-west, measuring distance north or south of the Equator (0° to 90° N or S). The Equator is 0° latitude.
- Longitude: Lines running north-south, measuring distance east or west of the Prime Meridian (0° to 180° E or W). The Prime Meridian passes through Greenwich, England.
Ethiopia’s approximate location: Latitude $3°\text{N}$ to $15°\text{N}$, Longitude $33°\text{E}$ to $48°\text{E}$.
Addis Ababa’s approximate coordinates: $9°\text{N}$, $38.7°\text{E}$.
B. Projected Coordinate Systems (Map Projections)
The Earth is a sphere (approximately), but maps are flat. When we convert the curved Earth surface to a flat map, we use a map projection. This process always causes some distortion in one or more of the following:
- Shape (angles between lines)
- Area (size of features)
- Distance (distance between points)
- Direction (bearing between points)
Types of Map Projections:
- Conformal projections: Preserve shape (angles). Distort area. Example: Mercator projection.
- Equal-area (equivalent) projections: Preserve area. Distort shape. Example: Albers projection.
- Equidistant projections: Preserve distance from a central point. Example: Azimuthal equidistant.
- Compromise projections: Try to balance all distortions. Example: Robinson projection.
• Latitude = north-south position (0° at Equator, 90° at poles).
• Longitude = east-west position (0° at Prime Meridian, 180° at International Date Line).
• A map projection converts the curved Earth to a flat map.
• All projections cause distortion in shape, area, distance, or direction.
• Conformal = correct shape. Equal-area = correct area. Equidistant = correct distance.
• Choose projection based on map PURPOSE.
8.2.5 Map Scale
Map scale is the ratio between the distance on a map and the actual distance on the ground. It tells us how much the real world has been reduced to fit on the map.
Three ways to express scale:
1. Representative Fraction (RF):
Both distances must be in the SAME units. Example: 1:50,000 means 1 unit on the map = 50,000 units on the ground.
2. Statement Scale:
Written in words. Example: “One centimeter on the map represents half a kilometer on the ground.”
3. Linear (Graphic) Scale:
A line drawn on the map divided into equal parts, each part representing a specific ground distance.
Worked Example 1: Finding RF from map and ground distances
The distance between two towns on a map is 8 cm. The actual ground distance is 40 km. Find the RF.
Worked Example 2: Finding ground distance from RF and map distance
The RF of a map is 1:100,000. The distance between two points on the map is 5 cm. What is the actual ground distance?
Worked Example 3: Finding map distance from RF and ground distance
The RF of a map is 1:250,000. The actual distance between two villages is 15 km. What distance will separate them on the map?
Worked Example 4: Converting RF to statement scale
Convert RF 1:200,000 to a statement scale.
Practice: A map has RF 1:75,000. Convert this to a statement scale. (Try before looking at the answer!)
Large Scale vs. Small Scale Maps:
| Feature | Large Scale | Small Scale |
|---|---|---|
| RF Example | 1:10,000, 1:25,000 | 1:250,000, 1:1,000,000 |
| Shows | Small area in detail | Large area with less detail |
| Features | More features visible | Fewer features, generalized |
| Use | Cadastral maps, city plans | World maps, country maps |
| Trick | Large RF denominator is SMALL | Small RF denominator is LARGE |
• RF = map distance ÷ ground distance (same units!).
• To convert km to cm: multiply by 100,000 (since 1 km = 100,000 cm).
• Ground distance = map distance × RF denominator.
• Map distance = ground distance ÷ RF denominator.
• Large scale = small denominator = small area, more detail (e.g., 1:10,000).
• Small scale = large denominator = large area, less detail (e.g., 1:1,000,000).
Question 1: The distance between two schools on a map is 3.5 cm. If the RF of the map is 1:50,000, calculate the actual ground distance in kilometers.
Answer:
Ground distance = 3.5 × 50,000 = 175,000 cm
Convert to km: 175,000 ÷ 100,000 = 1.75 km
Question 2: The actual distance between two cities is 120 km. On a map with RF 1:300,000, what is the distance between them on the map?
Answer:
Ground distance in cm = 120 × 100,000 = 12,000,000 cm
Map distance = 12,000,000 ÷ 300,000 = 40 cm
Question 3: A map shows a road that is 12 cm long. The actual road length is 6 km. Calculate the RF of the map and convert it to a statement scale.
Answer:
Ground distance in cm = 6 × 100,000 = 600,000 cm
RF = 12 / 600,000 = 1/50,000
RF = 1:50,000
Statement scale: “One centimeter on the map represents 0.5 kilometers (or 500 meters) on the ground.”
8.2.6 Map Elements
A properly made map must include certain essential elements. Without these, the map is incomplete and may be difficult to understand.
- Title: Tells what the map shows. Should be clear and specific. Example: “Land Use Map of Hawassa City, 2023.”
- Scale: Shows the relationship between map distance and ground distance. Can be RF, statement, or linear scale.
- Legend (Key): Explains the symbols, colors, and patterns used on the map.
- North Arrow: Shows the direction of north. Helps the reader orient the map.
- Source: States where the data came from. Example: “Source: Ethiopian Mapping Agency, 2022.”
- Date: Shows when the map was created or when the data was collected.
- Border (Neat Line): A frame around the map area.
- Coordinate Grid: Lines of latitude and longitude (or other grid system) for locating places.
- Inset Map: A small secondary map showing the location of the main map area within a larger region (optional but helpful).
• Every map MUST have: Title, Scale, Legend, North Arrow.
• Additional important elements: Source, Date, Border, Grid.
• The legend explains ALL symbols, colors, and patterns used.
• Without a scale, the map distances are meaningless.
• Without a legend, the symbols cannot be understood.
Question 1: A student creates a map of soil types in their district but forgets to include a legend. Explain why this is a serious problem.
Answer: Without a legend, the map is essentially useless because: (1) The reader cannot tell which color or pattern represents which soil type — they would just see different colored areas with no meaning, (2) Two maps could use the same colors for different things (e.g., red might mean “clay soil” on one map and “eroded area” on another), (3) The map cannot be interpreted, analyzed, or used for decision-making, (4) It defeats the entire purpose of making the map, which is to communicate information visually. The legend is the “dictionary” of the map.
Question 2: List any five essential elements that a well-prepared geographic map should contain.
Answer: (1) Title — identifies what the map shows, (2) Scale — shows the ratio of map distance to ground distance, (3) Legend/Key — explains symbols, colors, and patterns, (4) North Arrow — shows direction/orientation, (5) Source — states the origin of the data used.
8.2.7 Map Making Process Using GIS
Making a map using GIS involves several steps. Let us go through them one by one:
Step 1: Define the Purpose of the Map
Before you start, ask yourself: What is this map for? Who is the audience? What message should it communicate? For example, is it a map showing population density for a government planning report, or a map of tourist attractions for visitors?
Step 2: Collect and Prepare Data
Gather the spatial and attribute data you need. Data sources include:
- Field surveys (GPS data collection)
- Satellite images and aerial photographs
- Existing maps (digitized)
- Government databases (census, land use records)
- Open data sources (OpenStreetMap, etc.)
The data must be checked for accuracy, completeness, and consistency. This is called data cleaning.
Step 3: Data Entry and Digitization
If data is in paper form, it must be converted to digital format. This process is called digitization. There are two methods:
- Heads-up digitizing: Tracing features on screen over a scanned map or satellite image.
- Tablet digitizing: Using a digitizer tablet and puck to trace features from a paper map.
During digitization, you create vector features (points, lines, polygons) and enter their attributes in the attribute table.
Step 4: Data Analysis (Optional)
Depending on your purpose, you may need to analyze the data. Common GIS analysis operations include:
- Buffer: Creating a zone around a feature at a specified distance. Example: Creating a 2 km buffer around a river to show the protected area.
- Overlay: Combining two or more layers to find relationships. Example: Overlaying a soil type layer with a land use layer.
- Query: Selecting features based on their attributes. Example: Selecting all schools with more than 500 students.
- Clip: Cutting one layer using the boundary of another layer.
- Reclassification: Grouping data values into categories.
Step 5: Map Design and Layout
This is where you compose the final map by arranging all elements:
- Add the map frame (the main mapped area)
- Choose appropriate colors, symbols, and patterns
- Add the title, legend, scale bar, north arrow
- Add source, date, and other annotations
- Ensure the design is clean, readable, and not cluttered
Step 6: Review and Output
Review the map for errors. Check that all elements are present and correct. Then export or print the map in the required format (PDF, image file, printed copy).
• 6 main steps: Define purpose → Collect/prepare data → Digitize → Analyze → Design layout → Review/output.
• Digitization = converting paper maps to digital vector data.
• Buffer = zone around a feature at a set distance.
• Overlay = combining layers to find relationships.
• Query = selecting features by their attributes.
• Every map must have: title, scale, legend, north arrow, source, date.
Question 1: Explain the process of digitization in GIS. Why is it important?
Answer: Digitization is the process of converting analog (paper) map data into digital format that can be used in a GIS. It involves tracing geographic features (points, lines, polygons) from a scanned paper map or satellite image on a computer screen (heads-up digitizing) or using a digitizer tablet. As each feature is traced, its attribute data is entered into the attribute table. Digitization is important because: (1) Paper maps cannot be easily analyzed, queried, or overlaid — digital data can, (2) Digital data can be stored, copied, shared, and backed up easily, (3) GIS analysis functions (buffer, overlay, query) require digital data, (4) Digital maps can be updated more easily than paper maps, (5) It allows integration of data from different sources into a single system.
Question 2: A health officer wants to create a map showing all health facilities within 5 km of each settlement in a district. Which GIS analysis operation would they use? Explain how it works.
Answer: They would use the buffer operation. Here is how it would work: (1) First, they would load the settlement layer (points) and the health facility layer (points) into the GIS, (2) They would apply a buffer of 5 km around each settlement point, creating circular zones (or zones along roads if using network analysis), (3) Then they would use a spatial join or overlay to identify which health facility points fall within each 5 km buffer zone, (4) The result would show, for each settlement, which health facilities are within 5 km and which settlements lack access to a health facility within 5 km. This is a common application of GIS in public health planning.
8.2.8 Applications of GIS in Geography
GIS has many applications in geography and everyday life:
- Urban and Regional Planning: Planning land use, roads, water supply, and infrastructure.
- Agriculture: Mapping soil types, crop distribution, and planning irrigation.
- Environmental Management: Monitoring deforestation, erosion, pollution, and wildlife habitats.
- Disaster Management: Mapping flood zones, earthquake risk areas, and planning evacuation routes.
- Transportation: Planning road networks, analyzing traffic patterns, and navigation.
- Health: Mapping disease distribution, locating health facilities, and planning health services.
- Education: Mapping school locations, analyzing access to education, and planning new schools.
- Water Resources: Mapping watershed boundaries, groundwater potential, and water quality.
Can you think of how GIS could be used in your own community? For example, could it help plan where to build a new school or clinic?
• GIS is used in almost every field that involves spatial decision-making.
• Key application areas: urban planning, agriculture, environment, disaster management, health, transport.
• GIS helps make BETTER decisions because it shows spatial patterns and relationships.
• In Ethiopia, GIS is used by agencies like CSA (census), MoA (agriculture), and EPA (environment).
In this unit, you learned about: (1) Geographical enquiry — the systematic process of asking questions, formulating hypotheses, collecting data (primary and secondary), analyzing data, and drawing conclusions. (2) Sampling techniques — random, systematic, stratified, and cluster sampling, plus sample size calculation. (3) GIS — a computer-based system for managing, analyzing, and displaying geographic data, with five components (hardware, software, data, people, methods). (4) GIS data types — vector (point, line, polygon) and raster (grid of cells), plus attribute data. (5) Map projections and coordinate systems. (6) Map scale calculations. (7) Map elements and the map making process. These skills are essential for any geography student and have wide real-world applications.
Revision Notes — Exam Focus
Important Definitions
Hypothesis: A tentative, testable statement predicting the expected outcome of a research study or the relationship between variables.
Null Hypothesis (H₀): A statement that there is no significant relationship or difference between variables. It is the hypothesis that is tested statistically.
Alternative Hypothesis (H₁): A statement that there IS a significant relationship or difference between variables. It is accepted if the null hypothesis is rejected.
Primary Data: Data collected firsthand by the researcher through fieldwork, experiments, surveys, or observations.
Secondary Data: Data that already exists and was collected by someone else for a different purpose.
Population: The entire group of individuals or items that the researcher wants to study.
Sample: A subset of the population selected to participate in the study.
Sampling: The process of selecting a subset of the population to represent the whole population.
GIS: Geographic Information System — a computer-based system for capturing, storing, analyzing, and displaying spatial (geographic) data.
Vector Data: Spatial data represented as points, lines, and polygons to represent discrete features.
Raster Data: Spatial data represented as a grid of cells (pixels), each containing a value, used for continuous surfaces.
Attribute Data: Non-spatial descriptive data about geographic features, stored in attribute tables linked to spatial features.
Map Projection: A mathematical method of transforming the curved surface of the Earth onto a flat map surface, which always involves some distortion.
Map Scale: The ratio between a distance on a map and the corresponding actual distance on the ground.
Digitization: The process of converting analog (paper) map data into digital format for use in GIS.
Buffer: A GIS operation that creates a zone of specified distance around a geographic feature.
Key Formulas
1. Sample Size Formula:
Where: n = sample size, N = population size, e = margin of error
2. Systematic Sampling Interval:
Where: k = sampling interval, N = population size, n = desired sample size
3. Map Scale (Representative Fraction):
(Both in the same units!)
4. Ground Distance from RF:
5. Map Distance from RF:
6. Conversion Factor:
7. Mean:
8. Range:
Common Mistakes to Avoid
1. NOT converting units before calculating RF.
❌ Wrong: RF = 5 cm / 10 km (different units!)
✅ Correct: RF = 5 cm / 1,000,000 cm = 1:200,000
2. Confusing large scale and small scale.
❌ Wrong: 1:1,000,000 is large scale
✅ Correct: 1:1,000,000 is small scale (large denominator = small scale)
3. Writing a hypothesis that is not testable.
❌ Wrong: “Deforestation is harmful to the environment.” (value judgment)
✅ Correct: “Areas with higher deforestation rates have higher soil erosion rates.”
4. Forgetting map elements.
❌ Wrong: Drawing a map without scale, legend, or north arrow
✅ Correct: Include title, scale, legend, north arrow, source, and date
5. Confusing vector and raster data.
❌ Wrong: Using points to show temperature distribution
✅ Correct: Temperature = raster (continuous); Roads = vector (lines)
6. Using only one data collection method.
❌ Wrong: Relying only on a questionnaire
✅ Correct: Use triangulation — combine questionnaire + observation + interview
7. Forgetting to state units in scale calculations.
❌ Wrong: “The ground distance is 500,000”
✅ Correct: “The ground distance is 500,000 cm = 5 km”
8. Confusing population and sample.
❌ Wrong: “The population of my study is 50 farmers”
✅ Correct: “The population is 2,000 farmers; the sample is 50 farmers”
Quick Comparison Tables
| Sampling Methods Comparison | |
|---|---|
| Method | Key Feature |
| Simple Random | Every member has equal chance; use random numbers |
| Systematic | Select every k-th member; k = N/n |
| Stratified | Divide into strata, sample from each proportionally |
| Cluster | Select entire clusters randomly, study all members |
| Convenience | Select whoever is easily available (not scientific) |
| Purposive | Researcher selects based on judgment |
| Map Projection Types | ||
|---|---|---|
| Type | Preserves | Distorts |
| Conformal | Shape (angles) | Area |
| Equal-area | Area | Shape |
| Equidistant | Distance from center | Area and shape |
| Compromise | Balance of all | All somewhat |
| Vector vs. Raster | |
|---|---|
| Vector | Raster |
| Points, lines, polygons | Grid of cells (pixels) |
| Discrete features | Continuous surfaces |
| Smaller file size | Larger file size |
| Zoom without quality loss | Pixelation at high zoom |
| Roads, buildings, boundaries | Satellite images, DEM, temperature |
Simple Examples for Quick Recall
Challenge Exam Questions
Multiple Choice Questions
Q1. Which of the following is the BEST example of a testable hypothesis in geographical research?
(a) Soil erosion is a serious problem in Ethiopia.
(b) Areas with steeper slopes have higher rates of soil erosion than areas with gentle slopes.
(c) All farmers should practice soil conservation.
(d) Soil erosion is caused by rain.
Answer: (b) This is the best hypothesis because it is specific, testable, and states a clear relationship between two variables (slope steepness and erosion rate). Option (a) is a value judgment, not testable. Option (c) is a recommendation, not a hypothesis. Option (d) is too vague — it does not specify a measurable relationship.
Q2. In a GIS, which type of data would be MOST appropriate to represent a network of roads in a city?
(a) Point data
(b) Line data
(c) Polygon data
(d) Raster data
Answer: (b) Line data. Roads have length but no significant width on a map, so they are represented as lines in vector data. Points would only show intersections or endpoints. Polygons would show areas, not linear networks. While raster data CAN show roads, it is not the MOST appropriate — vector lines are the standard and most efficient representation for road networks.
Q3. A map with RF 1:10,000 is best described as:
(a) A small scale map showing a large area
(b) A large scale map showing a small area in detail
(c) A medium scale map
(d) A world map
Answer: (b) A large scale map showing a small area in detail. The RF denominator (10,000) is small, which means it is a large scale map. Large scale maps show small areas with great detail — for example, a map of a single neighborhood or a farm. A world map would have a much larger denominator like 1:50,000,000.
Q4. Which sampling method divides the population into subgroups and then randomly samples from each subgroup proportionally?
(a) Simple random sampling
(b) Cluster sampling
(c) Stratified sampling
(d) Systematic sampling
Answer: (c) Stratified sampling. Stratified sampling involves dividing the population into homogeneous subgroups (strata) based on a characteristic (e.g., age, income, land size) and then randomly selecting a proportional sample from each stratum. Simple random sampling does not involve subgroups. Cluster sampling selects entire clusters. Systematic sampling selects every k-th member.
Q5. Which of the following map projections is BEST for showing the correct sizes (areas) of countries?
(a) Mercator projection
(b) Albers equal-area projection
(c) Azimuthal equidistant projection
(d) Gnomonic projection
Answer: (b) Albers equal-area projection. Equal-area (equivalent) projections preserve area — the sizes of features on the map are proportional to their actual sizes on the ground. The Mercator projection is conformal (preserves shape) but greatly distorts area (making Greenland appear as large as Africa, when Africa is actually 14 times larger). The Azimuthal equidistant preserves distance from a center point, and the Gnomonic is used for great circle navigation.
Fill in the Blank
Q6. The process of converting paper map data into digital format for use in GIS is called __________.
Answer: Digitization. Digitization is the process of tracing geographic features from a scanned paper map or aerial photograph on a computer screen to create digital vector data (points, lines, polygons) that can be stored, analyzed, and displayed in a GIS.
Q7. In GIS, the descriptive information about a geographic feature (such as its name, population, or type) is called __________ data.
Answer: Attribute. Attribute data is the non-spatial descriptive information linked to each spatial feature in a GIS. It is stored in an attribute table and connected to the spatial data through a unique identifier. For example, a point representing a school (spatial data) would have attribute data such as the school’s name, number of students, and type (public/private).
Q8. A __________ hypothesis states that there is NO significant relationship between the variables being studied.
Answer: Null. The null hypothesis (H₀) states that there is no significant relationship or difference between variables. It is the starting point in statistical testing — the researcher collects data and tests whether there is enough evidence to reject the null hypothesis. If rejected, the alternative hypothesis (H₁) is accepted.
Q9. The GIS operation that creates a zone of specified distance around a geographic feature is called __________.
Answer: Buffer. A buffer creates a zone (area) at a specified distance around a point, line, or polygon. For example, a 500-meter buffer around a river creates a polygon showing all land within 500 meters of the river. Buffers are widely used in planning (e.g., buffer zones around protected areas, distance analysis for facility locations).
Q10. The point on the Earth’s surface directly above the focus of an earthquake is called the __________.
Answer: Epicenter. The focus (or hypocenter) is the point inside the Earth where the earthquake originates — where the rock actually breaks. The epicenter is the point on the Earth’s surface directly above the focus. Seismic waves spread outward from the focus, but the epicenter is what is reported in the news because it identifies the surface location of the earthquake.
Short Answer Questions
Q11. Differentiate between quantitative data and qualitative data. Give one example of each that could be collected in a geographical field study of soil types.
Answer:
Quantitative data is expressed in numbers and can be measured or counted. It is analyzed using statistical methods. Example in soil study: Soil pH value (e.g., pH 6.5), soil depth in centimeters (e.g., 45 cm), or percentage of organic matter (e.g., 3.2%).
Qualitative data is expressed in words and describes qualities, characteristics, or properties. It is analyzed using themes and patterns. Example in soil study: Soil color description (e.g., “dark reddish-brown”), soil texture description (e.g., “sandy loam, gritty to touch”), or drainage condition description (e.g., “well-drained”).
Q12. Why is it important to include a scale on a map? What would happen if a map had no scale?
Answer: A scale is essential because it tells the map reader the relationship between distances on the map and actual distances on the ground. Without a scale: (1) The reader cannot determine how far apart places actually are — a 5 cm gap on the map could mean 500 meters or 500 kilometers, (2) The reader cannot calculate actual areas of regions shown on the map, (3) The map cannot be used for navigation, planning, or any practical purpose that requires distance measurement, (4) Different users cannot compare the map with other maps. In short, without a scale, the map loses its most fundamental quantitative function.
Q13. Explain what triangulation means in the context of geographical data collection. Why is it recommended?
Answer: Triangulation in geographical data collection means using multiple methods, multiple data sources, or multiple researchers to study the same research problem. For example, studying deforestation using satellite image analysis AND field observation AND interviews with local people. It is recommended because: (1) It increases the reliability and validity of research findings — conclusions are stronger when supported by multiple sources of evidence, (2) It allows cross-verification — if three methods all point to the same conclusion, confidence in that conclusion increases, (3) It compensates for the weaknesses of individual methods — what one method misses, another may capture, (4) It reduces researcher bias, as no single method dominates the findings.
Q14. List four essential elements that every geographic map must have and briefly explain the purpose of each.
Answer:
(1) Title: Tells the reader what the map shows (topic, area, and time period). Without it, the map’s purpose is unknown.
(2) Scale: Shows the ratio between map distance and ground distance. Without it, distances and areas cannot be measured.
(3) Legend (Key): Explains the meaning of all symbols, colors, and patterns used on the map. Without it, the map features cannot be interpreted.
(4) North Arrow: Shows the direction of geographic north, allowing the reader to orient the map correctly and understand directions between features.
(Additional acceptable elements: Source, Date, Border, Coordinate Grid.)
Step-by-Step Calculation Questions
Q15. A geography student wants to survey farmers in a district with a total population of 1,200 farmers. Using a margin of error of 5%, calculate the required sample size using the formula n = N/(1+Ne²). Show all steps.
Answer:
Given: N = 1,200, e = 0.05
Step 1: Calculate e²
$$e^2 = (0.05)^2 = 0.0025$$
Step 2: Calculate N × e²
$$N \cdot e^2 = 1,200 \times 0.0025 = 3$$
Step 3: Calculate the denominator (1 + Ne²)
$$1 + Ne^2 = 1 + 3 = 4$$
Step 4: Calculate n
$$n = \frac{N}{1 + Ne^2} = \frac{1,200}{4} = 300$$
The required sample size is 300 farmers.
Q16. On a topographic map with RF 1:75,000, the measured distance between two points is 12.5 cm. Calculate the actual ground distance in (a) centimeters, (b) meters, and (c) kilometers. Show all steps.
Answer:
Given: RF = 1:75,000, Map distance = 12.5 cm
(a) Ground distance in centimeters:
$$\text{Ground distance} = 12.5 \times 75,000 = 937,500 \text{ cm}$$
(b) Ground distance in meters:
$$937,500 \text{ cm} \div 100 = 9,375 \text{ m}$$
(c) Ground distance in kilometers:
$$9,375 \text{ m} \div 1,000 = 9.375 \text{ km}$$
The actual ground distance is 937,500 cm = 9,375 m = 9.375 km.
Q17. Two towns are 45 km apart in reality. On a map, they appear 9 cm apart. (a) Calculate the RF of the map. (b) Express this as a statement scale. (c) Is this a large scale or small scale map? Explain.
Answer:
Given: Ground distance = 45 km, Map distance = 9 cm
(a) Calculate RF:
First convert ground distance to cm:
$$45 \text{ km} = 45 \times 100,000 = 4,500,000 \text{ cm}$$
$$\text{RF} = \frac{9}{4,500,000} = \frac{1}{500,000}$$
RF = 1:500,000
(b) Statement scale:
$$1 \text{ cm on map} = 500,000 \text{ cm on ground} = 5 \text{ km}$$
“One centimeter on the map represents 5 kilometers on the ground.”
(c) Large or small scale?
This is a small scale map. The RF denominator (500,000) is large, meaning the map shows a large area with less detail. It would likely show a regional or district-level view, not detailed features of individual neighborhoods.
Q18. A researcher plans to use systematic sampling to select 80 households from a village of 800 households. (a) Calculate the sampling interval (k). (b) If the random start number is 6, list the first five households to be selected. (c) What is the last household number that will be selected?
Answer:
Given: N = 800, n = 80, random start = 6
(a) Sampling interval (k):
$$k = \frac{N}{n} = \frac{800}{80} = 10$$
k = 10 (select every 10th household)
(b) First five households:
Start at 6, then add k (10) each time:
6th, 16th, 26th, 36th, 46th
(c) Last household:
The sequence is: 6, 16, 26, 36, …, continuing until we reach the 80th selection.
Formula: Last number = random start + (n-1) × k
$$= 6 + (80-1) \times 10 = 6 + 790 = 796$$
The last household selected is number 796.
Q19. The following table shows monthly rainfall (in mm) recorded at a weather station for six months. Calculate the mean, median, and range.
| Month | Jan | Feb | Mar | Apr | May | Jun |
|---|---|---|---|---|---|---|
| Rainfall (mm) | 15 | 22 | 45 | 78 | 120 | 95 |
Answer:
Data: 15, 22, 45, 78, 120, 95 (n = 6)
Mean:
$$\bar{x} = \frac{15 + 22 + 45 + 78 + 120 + 95}{6} = \frac{375}{6} = 62.5 \text{ mm}$$
Median:
First arrange in ascending order: 15, 22, 45, 78, 95, 120
Since n = 6 (even), median = average of 3rd and 4th values
$$\text{Median} = \frac{45 + 78}{2} = \frac{123}{2} = 61.5 \text{ mm}$$
Range:
$$\text{Range} = 120 – 15 = 105 \text{ mm}$$
Results: Mean = 62.5 mm, Median = 61.5 mm, Range = 105 mm
Q20. A student is conducting a geographical enquiry on “The effect of altitude on temperature in the Ethiopian highlands.” (a) Write a suitable null hypothesis and alternative hypothesis. (b) Identify two primary data collection methods the student could use. (c) Identify one appropriate method of data presentation and justify your choice.
Answer:
(a) Hypotheses:
H₀ (Null): There is no significant relationship between altitude and temperature in the Ethiopian highlands.
H₁ (Alternative): There is a significant relationship between altitude and temperature in the Ethiopian highlands.
(b) Primary data collection methods:
(1) Measurement: Use a thermometer to measure temperature at different altitude points, and use a GPS or altimeter to record the altitude at each point. This gives direct quantitative data.
(2) Field Observation: Observe and record vegetation types, agricultural crops, and other indicators that vary with temperature at different altitudes. This provides supporting qualitative data.
(Additional acceptable method: Secondary data — obtain temperature and altitude records from meteorological stations.)
(c) Data presentation method:
A scatter plot would be the most appropriate method. A scatter plot shows the relationship between two continuous variables by plotting altitude on the x-axis and temperature on the y-axis. Each point on the plot represents one observation (one location with its altitude and temperature). This allows the reader to: (1) visually see whether a relationship exists, (2) see the direction of the relationship (positive or negative), (3) see the strength of the relationship (how tightly the points cluster), (4) identify any outliers. A line of best fit could also be added. Alternatively, a line graph could work if data is collected along a transect (e.g., going up a mountain slope).
Q21. Explain the difference between a conformal map projection and an equal-area map projection. Give one use case for each in geographical research.
Answer:
Conformal projection preserves the shape of features (angles are correct). However, it distorts area — features near the poles appear much larger than they really are. Example: The Mercator projection.
Use case: Navigation and weather mapping, where accurate shapes and angles are important. For example, sailors use conformal projections because the angles between lines on the map match the actual compass bearings on the ground. Weather forecasters use them because wind direction patterns are shown accurately.
Equal-area projection preserves the relative sizes (areas) of features. However, it distorts shape — features may appear stretched or compressed. Example: The Albers equal-area projection.
Use case: Comparing the sizes of different regions, analyzing land use distribution, or mapping population density by area. For example, if a geographer wants to compare the total area of forest cover in different Ethiopian regions, an equal-area projection ensures that the sizes on the map are proportional to the actual ground areas.
Q22. Describe the five components of a Geographic Information System (GIS). Which component do you consider the most critical, and why?
Answer:
The five components of GIS are:
(1) Hardware: Physical equipment — computers, GPS receivers, digitizers, printers, and storage devices.
(2) Software: Programs like ArcGIS, QGIS, and Google Earth that process, analyze, and display geographic data.
(3) Data: Geographic data (spatial and attribute) — the raw material that the system processes.
(4) People: Trained users who collect data, operate the system, analyze results, and make decisions.
(5) Methods: Procedures, rules, and analytical models that guide how the GIS is used.
Most critical component: Data. Data is the most critical component because: (1) GIS is fundamentally a data processing system — without data, the hardware and software have nothing to process, making the system useless, (2) The quality of output (maps, analysis results, decisions) depends entirely on the quality of input data (“garbage in, garbage out”), (3) Data is the most expensive and time-consuming component to obtain — collecting, cleaning, and organizing data often takes more time and money than purchasing hardware or software, (4) All other components exist to serve the data — hardware runs the software, software processes the data, people manage the data, and methods standardize how data is handled.