Acoustics Unpacked

Main Menu
Suggested Standard Operating Procedures
History
Acoustic Background
- Acoustics Theory and Use
- The SONAR Equation
- Acoustic Transducers
- Near-field and Far-field
- Target Strength
- Volume Backscattering
- Detection Probability
- Signal-to-noise Ratio
- Vessel Noise
- Dead Zones
Equipment Deployment
- Frequency
- Beam Width
- Beam Configuration
- Transducer Deployment
- Vessel Speed
Survey Design
- Defining Objectives
- Target Species or Group
- Survey Timing
- Survey Components
- Time Budget
- Sampling Effort
- Analysis Expectations
- Types of Survey Design
- Logistics
System Calibration
- Known Issues
- Standard Values
- Conditions for Calibration
- Preparation
- Calibration Procedures
- Tolerances
- Data Management
Data Collection
- Documentation
- Data Management
- Collection Settings
- GPS
- System Settings
- Environmental Settings
- Correcting Incorrect Settings
Survey Protocols
- Conditions for Collection
- Data Management and Recording
- Environmental Data Collection
- Stationary Sampling
- Target Identification
- System Performance - Data Quality
Data Processing
- File Preparation
- Single Echo Detection
- Analysis Cell Size
- Separating Groups of Interest
- File Exports
Survey Calculations
- Total Backscatter
- Backscattering Cross-section
- Density
- Abundance
- Species-specific Abundance and Biomass
- Uncertainty
Uncertainty
Example Applications
- Survey Design
- Geovariance
- Cluster Sample Size Graph
- Cluster Sampling Analysis
- Near-field Distance
- Deadzone Height
- Resolvable Distance
- Degree of Coverage
- Ping Interval
- Noise Level
- Passive Data Monitoring
- Density Effects and Nv
- TS Sv Graph
References
Equations
Figures
Tables
Contact Us
Acknowledgements
Useful Links

Cluster Sampling Analysis

Example

Consider a data set that contains fish density information (# fish per m^-2) on YAO smelt collected in July 2007 on the southend of the main lake of Lake Champlain (this dataset can be downloaded at this link sml.csv). The data are in comma delimited format with column headers (the program skips the first line when reading in the file) with transect number in column 1 (integer format) and fish density in column 2. The area covered for this example is assumed to be 100 km² or 100,000,000 m² or in scientific notation 1.0e8 m². Area is needed to expand the density estimates contained in the file to total abundance for the entire area of the survey. Typically the total survey area comes from a different source than the survey observations included in the file and thus is here entered in by hand into the table prior to using the file data to provide the summary calculations. The table below provides the expected results:

Variable Value

n
8

mean m
41.25

mean rho
0.088

s2clu
0.5744

SE(rho)
0.0065

Area
1.0e8

N
8.8e6

SE(N)
6.5e5

Theory

Cluster sampling may be used for systematic or random parallel transects or for zig-zag transects using only parallel zigs OR parallel zags.

Cluster sampling is an appropriate design and analysis method to consider for acoustics as clusters of observations are typically taken along a transect and not as, say, independent 1-minute sample units randomly scattered throughout the population. The clustered nature of the samples often requires that additional attention be paid to the type of analysis used so that the most can be made from the number of samples collected. A major advantage of this method is that it will weigh estimates according to transect length. Since transect lengths are seldom identical, this is the recommended method for acoustics surveys in general when geostatistics is not being used (see below).

In an acoustic example of cluster sampling:

Transects are clusters, and;

Horizontal bins are elements within clusters.

The first step is to compute an aggregate density estimate P_iacross all the elements in each cluster i as follows:

[42]

where:
m_i is the number of elements (bins) in cluster (transect) i;
ρ_j is density in horizontal bin j (#m^-2);

Notice that P_i is also in units #m^-2, but this is misleading as it P_i represents the sum of all densities and is therefore a function of the number of bins. If we multiply this estimate by the average area per bin we would get total number per transect, which is typically used in textbook presenting cluster analysis, but we leave that extra bit of calculation out here as, in the end, it cancels out.

We can compute the average density:

[43]

where:
n is the number of clusters (transects) in the sample;
P_i is the agregate density observed in cluster i, and;
mi is the number of elements (bins) in cluster i, with i = 1,…, n.

The cluster variance (s²_clu) and the standard error of the estimated average number per bin (SE()) may then be found:

[44]

[45]

where:
P_i is the agregate density in cluster i;
is the average number per bin over all clusters;
m_i is the number of elements (bins) in cluster i, i = 1,…, n;
n is the number of clusters in the simple random sample; and
is the estimated average number of elements (bins) per cluster (transect), such that

[46]

Cluster sampling estimates may be expanded to total population abundance (N) by simply multiplying average density by area:

[47]

where:
A is the total area;
is the average density (#m^-2 area or #m^-3 volume).

The standard error of the population abundance is:

[48]

where, again, SE() is the standard error of the estimate mean density derived from the cluster sampling method described above.