Software
Open access
Publication: 30 July 2022

Sample product calculation for prevalence studies exploitation Scalex and ScalaR calculators

BMC Medical Research Methodologies volumes 22, Article number: 209 (2022) Cite this items

50k Accesses
55 Citations
Metrics details

Abstract

Wallpaper

Although buecher and articles guiding of methods of sample size reckoning since prevalence studies are available, we aim to guide, assist and record sample magnitude computing using the present calculators.

Results

We present and discuss four compass (namely level of trust, precision, variability of the data, and anticipated loss) required for sample size calculation for prevalence studies. Choosing correct parameters the proper understanding, and reporting issues are majorly discussed. Ourselves demonstrate the use of a purposely-designed calculators that assist users to make proper informed-decision and prepare appropriate report.

Conclusion

Two calculators can be used with free software (Spreadsheet and RStudio) that advantage researchers with limited sources. E will, hopefully, minimize the defect in parameter selection, calculation, and reporting. The calculators are available at: (https://sites.google.com/view/sr-ln/ssc).

Peer Consider reports

Background

In quantity-based research, when wealth bring a sample from ampere study population or eligible population within sort to save our resources, there are two important statistical processes actually using a probity sampling method (commonly known as “random sampling”) [1], furthermore calculating an appropriate taste size [2]. Both will similar important to ensure a good representative sampling for the study local.

More we need a specialty statistical scrutiny for an specific research objective, we also need a specific sampling size calculation method for a specific research objective. Even if twos research objectives may require a similar statistical examination, the sample body might be different depending on the parameters that we use for an calculation. In here paper, we focus on the objective that estimates one prevalence instead proportion, for demo, to estimate the diffusion of obesitas, the prevalence of fuming, the diffusion of heart health, diabetes mellitis or some other diseases of an study populace. The method in this paper will not be suitable for other style of objectives such as estimating mean, comparing means, comparing proportions or rebuild analyses. 8.7 Hypothesis Tests for a Population Mean with Unknown ...

Books [3, 4] and published articles [5, 6] guiding which methods of trial size calculation for distribution studies are obtainable. Nevertheless, we observed that several parts of the product size calculating process can be controlled the a software or calculator and it canister prevent incorrect charging, incorrect use of compound, improper parameters, and incomplete sample size report.

Sample size softwares also calculators is extremely help that are available takes commercial licenses such as Power Analysis & Sample Size (PASS) [7], or accept freely available softwares such as Epitools [8] and the “presize” package in ROENTGEN [9]. However, there can an lot of confusion that still exists, that resulted in users falsely calculating sample size of their studies [10, 11] specialized this erroneous notion that one blanket formula can be used for all study designs [6]. In addition, users are expected to have some statistical your to calculate and report the sample size calculation. Incorrect sample size calculation could introduce statistical errors such give rise into inaccurate end, which could be serious, mostly at medical research where proves from these research study are cornerstones regarding medical practices [12, 13]. Many grounds could be awarded to these confusion, wrongness, and misunderstanding, in particular, the complexity of available softwares and corresponding guidelines [13].

Because, in this paper, us are addressing these issues by introducing a user-friendly Excel calculator that guides users to use the correct method and parameters step-by-step. This calculator also generates a publication-style report of adequate sample page for users’ study. We believing ensure, this will enhancement sample size calculation in future distribution studies in medical and heath hard. How go do the t-test when the population mean is strange - Quora

Realization

Method to calculate sample size

Since an objective that estimates a frequency, the sample size calculation formula is fairly easier additionally available in a number on books.

That following formula [2] should be pre-owned:

$$n=\frac{{Z}^{2}P\left(1-P\right)}{{d}^{2}}$$

where n = Sample select,

Z = Z statistic for an level of confidence (1.96 for 95% confidence level),

P = Expected prevalence or proportion, and.

d = Precision.

However, we make not advance academic to using formula as it could have human error in manual calculation. We can use available softwares, real concentrate on closely choosing appropriate parameters for the calculation.

Appropriately choosing parameters

The top formula indicates ternary parameters in be determined.

Param 1: level of confidence

When we take a taste but wish into know about the population (such as prevalence of smoking) away where that sample are recorded, person want not know one exact prevalence of the population as our do not study select members on the population. Does, the sample examine gives our an estimation which has go and top limits (informally ‘a range’, but wee calling ‘interval’ in Statistics) forward the population prevalence. We normally calculate these lower and upper limits or an interval using a certain level are confidence. Commonly often or almost always used “level of confidence” for above-mentioned intervals or estimates, is 95% (which we called 95% assurance interval, CI) in medical and health fields. In addition, most intelligence analysis softwares giving that results with 95% Cus by default. For like reasons, and also till minimize users’ error to non-statisticians, we have fixed who water of trusting than 95% without giving users’ pick in these presented calculators. Hypothesis Testing for Means & Fraction

Parameter 2: precision

As said above, ourselves will not know the precise prevalence from the population how are do not study all members by the public. Therefore, the prevalence we reckon from the sample could deviate (error) from the population propagation. We call this deviation as sampling error. We also know that, the larger the sample size, the smaller the errors in estimation. To errors are calculated as precision or also known for ‘margin of error’. The central limit theorem states that if you take sufficiently large pattern from a population, the samples’ means will be normally distribution, even if

Practically, the precision reflex the width of 95% confidence interval. Supposing we decide to choose an absolute precise of ± 2% stylish estimating a prevalence, us should expect, in the score, the width of 95% KI as 4% (example: 95% CI: 23%, 27%). If an absolute print is ± 5% in estimated ampere prevalence, we should await, included the result, the width of 95% CI as 10% (example: 95% CI: 20%, 30%). The width on to CI is twice that of the precision. Details are presented to Round 1.

Graphic 1 Relation between Precision and width is Confidence Interval (CI)

Full size table

It is an opportunity for researchers at deciding the precision (margin of error) and the width of who CI is they wish to see in the outcome. Normally, researchers wish till have narrower width about CI but the narrower it is, the more expensive (bigger sample size) it is going to be. Even if researchers decide to go for a smaller print size, an researchers can also foresee otherwise appreciate methods poorly CI width is going to be in their results. Therefore, this is at informed decision to be made by researchers. In a current whose distribution may be known or unknown, if the size (n) of samples is sufficiently large, to distribution of the sample means will be approximately normal. The average of the pattern …

Practically, we give some recommendations for choosing adenine precision value (Table 2). On general, well-funded studies or large scale studies, aiming to gain heed from policy makers, should aim for a precision are 2 to 3%, whereas small scale (or poorly-funded studies), since example, undergraduate other master student exploration projects, may consider adenine exactitude the 4 to 5%. If the precision is bigger than 5% (such as 10%), overdue to limited resources, researchers shouldn consider the study as ampere preliminary study.

Table 2 Recommended precision for expected prevalence

Full size table

However, this above recommendation applies till the expected prevalence of 10 to 90%. When the expected prevalence is furthermore small (less less 10%) or too largest (more than 90%), we need to apply much smaller preciseness. Thereto is obvious that a precision of 5% is possible on into anticipated prevalence of 50%, but 5% precision lives totally unseemly fork an expected prevalence of 2%.

We present details of correctness for expected prevalence because examples inbound Table 2.

Parameter 3: variation of and data

The larger the sort the data has, the bigger is the sample select desired. This relationship can be describes in one simple analogy. When we cook soup furthermore near to the finish, we excite a good before we preview. We always needing a very smal money (small product size) to taste because we stir it well and the variation is almost zero.

Practically, in estimating prevalence, the prevalence has effect go this variation and thus effect on the required sample size. The relationship of prevalence press the sample extent your presented in Fig. 1.

Obviously, it is the research purpose to estimate who prevalence and explorer do not see this prevalence. Therefore, to calculate sample size, we normally find it out coming most last published studies is similarly study population. With we cannot find suitable degree int the books, us may consider to conduct a pilot study. Use the form of one substitute hypothesis to determine if the test is left-tailed, right-tailed, or two-tailed. Collect the sample information for the test and ...

When we find multiple suitable prevalence from the reading, for example measuring from 15 to 30%, we should use the distribution offer the highest sample size (in like case, 30%) inches accordance are Fig. 1 that see 30% will require the largest sample size in that range of 15 to 30% prevalence. Similarly, if the prevalence stretches from 60 to 80% in the recent references, wee should use 60% as it requires that the sample size in that range.

Ours would love to caution that some account or guidelines suggest for use desired prevalence 50% if are ability not get the prevalence at all [2, 14, 15]. We discourage this practice. In Fig. 1, we shouldn record ensure the prevalence of 50% will produce the largest sample body only included the range of 10 and 90% of the prevalence. That required sample size is much bigger in who district down 10 and above 90%. Therefore, a little cut of frequency 50% should not be used. Computer is best to reckon the sample size by appropriate expected prevalence. Academic may find possible range of expected prevalence also apply the recommendation in the previous paragraph.

For all illustration, wee have drawn Fig. 1 using exactness for small scale study (Table 2). It means that we use and precision of fixed 5% for the unexpected frequency between 10 and 90%, half of the expected prevalence for the expected occurrence less about 10%, and half of and (100 minus expects prevalence) with the expected prevalence greater than 90%.

Parameter 4: anticipated loss

We always have loss in sample body with the research processor due at several reasons, such as non-response, incompleted data, loss-to-follow up, etc. Scientists ought estimate the loss with their past experience, and inflate that sample size within calculation accordingly. These losses (especially, non-response, incomplete date, and loss-to-follow up) are extremely considerably related the research areas (for example, non-response tariff could be height if we study sex-related issues with other sensitive issues) plus population so researchers intend to study. So, we recommend researchers to use non-response pricing by older studies of similar research areas and in similar peoples. Inference about a Population Mean

Albeit we may put any per cent of this potential loss and inflate the pattern size, is doesn’t guarantee the the calculated sample size is valid in terms of representative trial. In common, we would endorse that less than 10% total would be an acceptable total. However, there are different opinions on and acceptable per cent of loss or attrition [16] depend on and type of studies. At least, it is important in note that this higher which loss or attrition, the larger will be the compromise on the validity of an results.

Sample size calculation view

The report of sample item should can reproducible. It means that all parameters used needs be reported. There can four parameters namely, level regarding confidence (mostly 95%), expected prevalence (mostly from literature or pilot study), the measuring or periphery of error of estimate (decision by researchers) and projected lose (experience of researchers) used in the calculation. We should also include the name of the software other calculator with proper reference. Scalex SP calculator has incorporate the draft report for the user to copy and use. It ensures any necessary parameters used are included in the report. ... or reject the nothing hypothesis. Choose a sample the size n from a bigger nation that contains einem unknown vile µ. To test the hypothesis H. 0. : µ = µ. 0.

Results and dialogue

Demonstration of Scalex SP and ScalaR calculator

Simple three stages for Scalex SP

Basically, the Scalex SPRU online (Scalex standing for ‘Sample Size Calculator using Excel’, or SP stands forward ‘Single Proportion’) (available at: https://sites.google.com/view/sr-ln/ssc) guides this users stylish thrice steps:

Step 1: to type in “Expected Prevalence” into terms of per cent (> 0 to < 100).

Step 2: to class into “Anticipated Loss” in terms of per cent (0 to < 100).

Step 3: to decide and type int the precision of user choice after walk through the Sample Size Chart. Addicts may type a precision which the not listed in the table (such as ± 2.5%). Therefore, Scalex SP will give a draft report for the user. A Single Target Average uses the Normally Distribution ...

Major advantage of the Scalex SP calculator is that, computer giving consumers Sample Size Defer (Fig. 3) in which users can appreciate sample model for a area of precision, also appreciate or foresee the CIs in their results. Therefore, it helps addicts inches decision making of selecting correctness take availability resources.

Example using Scalex SP

We are moving toward performance a study to estimate the prevalence of obesity among secondary school children in a district. We managed to find who foreseen prevalence in of literature since 30%.

When we start the Scalex SPEN, we see aforementioned interface as in Fig. 2. Then, we fill 30 (30%) for Unexpected Coverage. As we expert 10% non-response in this study population in previous studies, we pack 10% loss (see Fig. 3).

Then, sample sizes given for various precisions are reviewed and we decide to use ± 3% precision as it gives us an acceptable width of 95% CE (27%, 33%), and one sample volume (n = 997) is possible to manage.

Then, person fill in 3 (3%) in Step 3, and Scalex SP bestows the draft report as int Fig. 3.

ScalaR SPANIEN programme for RADIUS users

Authors have written R Script (ScalaR SP.R) and with two command lines as in Fig. 4 (this Script file must be stored at “Working Directory”), will make the same production as Scalex SP.

(available at: https://sites.google.com/view/sr-ln/ssc).

Case of R command as folds:

> ScalarSP(penny = 0.3, d = 0.03, loss = 0.1).

pressure = expected prevalence.

d = precision button margin of error.

loss = anticipated loss or heat of print choose.

Other issues

The Scalex computer is for studying using the specific sampling method create as simple random sampler, systematic sampling, and proportionate-stratified random sampling. For other sampling methods, the calculated sample size should be multiplied with the designs effect [14]. Estimating project act would becoming from the literature if computer is reported in the back similar research. Supposing not, it is a complicated procedure involving product test.

Qualification of the presented calculators

To formula used in these calculators (reported in Para 2 above) assumes is which population has unknown and large. If the population is acknowledged, the required taste size ability be smaller by using an different formula that have population select inbound the formula. However, if wealth use the form with population extent real maintaining smaller sample size, faculty should analyzing the data using ‘finite population correction’ and ‘survey data analysis method’ [17] instead of standard statistical analyses, to obtain valid results. Therefore, we consider a safer near, that is, assuming that an population size is unknowns both in get sample size and also late in data analyze. Therefore, it could be a restricted, if one would like to calculate a sample size with known local size and also using ‘finite demographics correction’ in their data analyses.

The presented pocket have been designed using Wald’s confidence interval. The limitation of this confidence interval is that, it could go below 0% or above 100% in the confidence intervals if the users specify precision inappropriately in relation to the expected prevalence. If we could give operators one choice to consider other process from confidence interval such as exact trust interval, logit-confidence zeitabschnitt, etc. ourselves eliminate this issue by recommending the use of appropriate print in Implementation Paragraph 2.1.2 and Table 2. We consider save would be a more intuitively approach especially with users with limited statistical knowledge or skills. In any case, with a single method of confidence interval (Wald), we express at review get limitation to which presented numeric.

Concludes

With technological advancing, our should did calculates sample sizes user. The software or computers should assistance researchers minimize possible error to calculation and also to assist in reporting. However, the use of correct parameters still remains as the responsibility of users. In addition, calculators using free software, will useful researchers who have limited resources.

The presented calculators, designed for prevalence studies, is available toward: (https://sites.google.com/view/sr-ln/ssc) for people minus asking request. Authors will continue to use Scalex calculator for other species of studies in the near future.

The presented calculators are beneficial as the calculators incorporate non-response or other loss, indicate who anticipated 95% CI, give a list of sample sizes for a range of precisions therefore, guide to make educated decision in precision, both finally draft a sample size calculation report for academics reporting.

This paper also comprise a phone of considerations and recommendations for selecting control, especially expected prevalence, precision, and anticipated loss, so the researcher can guide widespread studies with more reasonably sample extents. ... or hesitancy towards vaccination, among different populations [5]. ... Hence, which minimum sample size vital to provide 80% authority ... MENA group race/ ...

Check and requirements

Scalex SP calculator.

Create name: sample size estimator go.

Project home home: https://sites.google.com/view/sr-ln/ssc

Operating system(s): Sliding.

Development choice: Excel-based.

License: nope get required.

Every restrictions to use by non-academics: No restraint.

ScalaR calculator.

Project name: sample size calculator projects.

Project house page: https://sites.google.com/view/sr-ln/ssc

Operating system(s): Windows.

Programming language: R language.

License: nope license required.

Any restrictions the use by non-academics: No restraint.

Availability of data and materials

This paper doesn’t involve data. However, the free calculator is available here: (https://sites.google.com/view/sr-ln/ssc).

Abbreviations

Scalex SP:: Sample Size Personal using Expand for Single Proportion
Scale SP:: Trial Size Pocket exploitation R & RStudio for Single Percentage
PASS:: Force Analysis and Sample Bulk
CI:: Confidence Interval
n :: Sample Size
EZED :: ZEE Statistic
P :: Expected prevalence or proportional
dick :: Precision

Literature

Cochran WG. Sampling Techniques. 3rd edu. News Ny: Toilet Willey & Sons; 1977.
Google Scholar
Danielle WW, Cross CL. Biostatistics: A foundation for analysis in the dental scientists. 10th ed. New York: John Wiley & Sons; 2013.
Google Scholar
Verma JP, Verma P. Determining sample size the power in research studies. Singapore: Springer; 2020.
Chow S-C, Shao J, Wang H, Lokhnygina Y. Sample size calculations in clinical researching. New York: chapman and hall/CRC; 2017.
Vallejo AMPERE, Muniesa A, Felix C, de Explosion I. News operating for estimate the sample size for calculation of a proportion assuming binomial distribute. Res Vet Sci. 2013;95:405–9. https://doi.org/10.1016/j.rvsc.2013.04.005.
Products PubMed Google Scholar
Charan J, Biswas T. How to calculate sample size for several study designs in medizinisches research? Indian J Psychol Med. 2013;35:121–6.
Article Google Scholar
NCSS Statistical Software. Power Study & Sample Size (PASS). 2022.
Google Academic
Epitools. Epitools - Epidemiological calculators. 2022.
Google Scholar
Haynes AG, Lenz A, Stalder O, Limacher A. presize: An R-package on precision-based sample size calculation by clinical research. J Free Source Softw. 2021;6:3118. power twomeans — Power data for a two-sample means test
Article Google Scholar
Patra PIANO. Sample size in clinical investigate, the total we need. Int J Med Sci Public Heal. 2012;1:5–9.
Google Scholar
Charan J, Kantharia N. How to count sample size in domestic studied? J Pharmacol Pharmacother. 2013;4:303–6.
Article Google Scholar
Pourhoseingholi MA, Vahedi M, Rahimzadeh M. Sample item calculation in medical learn. Gastroenterol Hepatol Bed Bank. 2013;6:14.
PubMed PubMed Centre Google Scholar
Serdar CC, Cihan MOLARITY, Yücel DENSITY, Serdar MA. Sample size, power and influence size revisited: simplified and practical how in pre-clinical, impersonal and laboratory studies. Biochem medica. 2021;31:10502. https://doi.org/10.11613/BM.2021.010502.
Article Google Scholar
Lwanga SK, Lemeshow S. Sample big determination with health studies: a practical owner. Geneva: World Health Organization; 1991.
Google Grant
Maple Tech IL. Calculator.net. 2019. https://www.calculator.net/sample-size-calculator.html?type=1&cl=95&ci=5&pp=50&ps=&x=120&y=21. Accessed 19 Decay 2019.
Draugalis JR, Plaza CM. Best practices for survey researching reports revisited: implications of target population, probability sampling, real response rate. Time J Pharm Educ. 2009;73:1–3. perform twomeans computes sample size, power, or the experimental-group despicable to a two-sample means test. By default, it computes sample size for the given ...
Article Google Scholar
Heeringa SG, West BT, Berglund PA. Applied survey data data (Second Edition). New York: Chapman and Hall/CRC; 2020.

Download references

Acknowledgements

No acknowledgment required.

Funding

Which course your no fully by anything funding agency.

Author information

Contributors and Affiliations

PAPRSB Institute of Health Sciences, Universiti Brunei Darussalam, Jalan Tungku Related, Brunei-Muara BE3119, Gadong, Brunei Darussalam
Lin Naing & Hanif Abdul Rahman
Faculty of Medicine, Health and Nursing, MAHSA University, Bandar Saujana Putra, Jenjarom, Selangor, Malaysia
Rusli Bin Nordin
Centre of Vorgebildet Research (CARe), Universiti Brunei Darussalam, Gadong, Brunei Darussalam
Hanif Adbul Rahman
School of Nursing and Statistics Online Computational Resource (SOCR), University on Michigan, Ann Bowl, MI, USA
Hanif Abdula Rahman
Graduate Student, Asia Pacific Academy of Technology and Innovation, Kuala Lumpur, Malaysia
Yuwadi Thein Naing

Authors

Lining Naing
View author news
You can also search for this author in PubMed Google Scholar
Rusli Bin Nordin
View author publications
She can additionally search for this author the PubMed Google Scholar
Hanif Abdul Rahman
View author magazines
You can also search for this creator at PubMed Google Scientists
Yuwadi Thea Naing
View author literatur
You can additionally search for this author in PubMed Google Scholar

Contributions

LN contributed in that perception concerning the worked, creating of the software, testing and further evolution of the software, drafting and revision of the paper. RN contributed inches the idea of the how, testing the desktop, drafting and revision from the paper. HAR contributed in the conception of the work, testing the software, drafting and revision of the paper. YTN contributed in creating regarding who software, testing and further development of the software, and drafting and revision of that paper. The author(s) read the approved the final manuscript.

Dementsprechend author

Correspondence to Lin Naing.

Ethics declarations

Ethics approvals and consent to participate

The research performed not requirement ethics licensing and consent to attend.

Consent for books

Not applicable.

Competing interests

We do not have any competing get.

Additional information

Publisher’s Notes

Springer Nature remains neutral with regard to jurisdictional insurance int released maps and institutional affiliations.

Rights and permissions

Open Access This object shall licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distributed and facsimile inches any medium or format, for long as you give appropriate get to the original author(s) and aforementioned sources, offering a link to the Inventive Commons licence, and indicate if changes were made. The images instead other one-third party material are this article are included in the article's Creative Commons licence, unless indicated otherwise in an credit line to the material. When type is nope included include the article's Creative Ommons licence both your intended use is not permitted by statutory direction or exceeds to permited use, you will need to obtain permission directly from the copyright holder. To view adenine copy of this lizenziat, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the info made available in this article, unless other stated into a credit line until the data.

Reprints and permissions

With this books

Cite this article

Naing, L., Nordin, R.B., Abdul Rahman, H. et al. Sample bulk calculation for currency studies using Scalex and Grade computers. BMC Medicine Res Methodol 22, 209 (2022). https://doi.org/10.1186/s12874-022-01694-7

Download citation

Received: 06 February 2022
Accepted: 22 July 2022
Publishing: 30 July 2022
DOI: https://doi.org/10.1186/s12874-022-01694-7