Sample size for evaluation the of multicollinearity degree in productive traits of rye
DOI:
https://doi.org/10.5902/2179460X41667Keywords:
Secale cereale L, Sampling design, Correlation, Multivariate analysisAbstract
The objectives of this work were to determine the sample size (number of plants) necessary to estimate the indicators of the of multicollinearity degree - condition number (CN), determinant of the correlation matrix (DET), and variance inflation factor (VIF) - in productive traits of rye and to verify the variability of the sample size between the indicators. Five and three uniformity trials were conducted with the cultivars BRS Progresso and Temprano, respectively, and seven productive traits were evaluated in 780 plants. Twenty-one cases were obtained from seven traits, combined five to five. In each case, 197 sample sizes were planned (20, 25, 30, ..., 1,000 plants) and in each size 2,000 resampling were performed, with replacement. For each resample the CN, DET and FIV were determined and the average among 2,000 estimates of each indicator of the multicollinearity degree was calculated. Then, for each case and indicator, the sample size was determined through three models: models of maximum modified curvature, segmented linear with plateau response, and segmented quadratic with plateau response. There was superiority the quadratic model segmented with plateau in adjusting the degree of multicollinearity according to the sample size for all indicators. There is a need greater sample size to detect multicollinearity when diagnosed by DET and for sizes larger than 101, 258 and 102 plants when diagnosing for the number of conditions, determinant and inflation factor performed, respectively.
Downloads
References
ALVARES CA, STAPE JL, SENTELHAS PC, de MORAES GONÇALVES JL, SPAROVEK G. Köppen’s climate classification map for Brazil. Meteorol. Zeitschrift. 2013;22(6):711-28.
ALVES BM, CARGNELUTTI FILHO A, BURIN C. Multicollinearity in canonical correlation analysis in maize. Genet. Mol. Res. 2017;16(1):1-14. https://doi.org/10.4238/gmr16019546.
ALVES BM, CARGNELUTTI FILHO A, BURIN C, TOEBE M. Correlações canônicas entre caracteres agronômicos e nutricionais proteicos e energéticos em genótipos de milho. Rev. Bras. Milho e Sorgo. 2016;15(2):171-85. https://doi.org/10.18512/1980-6477/rbms.v15n2p171-185.
AZEVEDO AM, SEUS R, GOMES CL, FREITAS EM de, CANDIDO DM, SILVA DJH da, et al. Correlações genotípicas e análise de trilha em famílias de meios-irmãos de couve de folhas. Pesqui. Agropecuária Bras. 2016;51(1):35-44. https://doi.org/10.1590/S0100-204X2016000100005.
BAIER AC. Centeio. Passo Fundo - RS: EMBRAPA Trigo; 1994.
http://ainfo.cnptia.embrapa.br/digital/bitstream/item/164511/1/FL-06193.pdf.
CARGNELUTTI FILHO A, ALVES BM, SANTOS GO, WARTHA CA, KLEINPAUL JA, SILVEIRA DL. Sample size to estimate the mean and median of traits in flax. Brazilian J. Agric. Sci. 2018a;13(1):1-7. https://doi.org/10.5039/agraria.v13i1a5492.
CARGNELUTTI FILHO A, ALVES BM, TOEBE M, BURIN C, WARTHA CA. Sample size to estimate the mean of traits in jack bean. Brazilian J. Agric. Sci. 2018b;13(1):1-7.
https://doi.org/10.5039/agraria.v13i1a5505.
CARGNELUTTI FILHO A, WARTHA CA, KLEINPAUL JA, NEU IMM, SILVEIRA DL. Sample size to estimate the mean and median of traits in canola. J. Agric. Sci. 2018c;10(11):123-37. https://doi.org/10.5539/jas.v10n11p123.
FIELD A. Descobrindo a estatística utilizando o SPSS. 2 ed. Porto Alegre: Artmed; 2009.
FARAWAY J. faraway: Functions and Datasets for Books by Julian Faraway. R package version 1.0.7, 2016. https://CRAN.R-project.org/package=faraway.
FOLLMANN DN, CARGNELUTTI FILHO A, SANTOS MS dos, COSTA VO, PLAUTZ ÉN, SCOPEL JVF, et al. Correlations and path analysis in sunflower grown at lower elevations. J. Agric. Sci. 2019;11(2):445-53. https://doi.org/10.5539/jas.v11n2p445.
GUJARATI DN, PORTER DC. Econometria básica. 5 ed. Porto Alegre: AMGH Editora Ltda; 2011.
HAIR JF, BLANCK WC, BABIN BJ, ANDERSON RE, TATHAN RL. Análise multivariada de dados. 6 ed. Porto Alegre: Bookman; 2009.
MEIER VD, LESSMAN KJ. Estimation of optimum field plot shape and size for testing yield in Crambe abyssinica Hochst. Crop Science. 1971;11(5):648-650.
MONTGOMERY DC, PECK EA. Introduction to linear regression analysis. New York: John Wiley and Sons; 1982.
NOURAEIN M. Elucidating seed yield and components in rye (Secale cereale L.) using path and correlation analyses. Genet. Resour. Crop Evol. 2019;66(7), 1533–1542.
https://doi.org/10.1007/s10722-019-00813-6.
OLIVOTO T, NARDINO M, CARVALHO IR, FOLLMANN DN, FERRARI M, PELEGRIN AJ de, et al. Optimal sample size and data arrangement method in estimating correlation matrices with lesser collinearity: A statistical focus in maize breeding. African J. Agric. Res. 2017;12(2):93-103. https://doi.org/10.5897/AJAR2016.11799.
R TEAM CORE. R: A language and environment for statistical computing. Vienna: R Foundation for Statistical Computing; 2018. https://www.r-project.org.
SANTOS HG dos, JACOMINE PKT, ANJOS LHC dos, OLIVEIRA VÁ de, LUMBRERAS JF, COELHO MR, et al. Sistema brasileiro de classificação de solos. 5 ed. Brasília: Embrapa Solos; 2018.
SARI BG, LÚCIO AD, OLIVOTO T, KRYSCZUN DK, TISCHLER AL, DREBES L. Interference of sample size on multicollinearity diagnosis in path analysis. Pesqui. Agropecuária Bras. 2018;53(6):769-73. https://doi.org/10.1590/s0100-204x2018000600014.
TOEBE M, CARGNELUTTI FILHO A. Não normalidade multivariada e multicolinearidade na análise de trilha em milho. Pesqui. Agropecuária Bras. 2013;466-77. https://doi.org/10.1590/S0100-204X2013000500002.
TOEBE M, BANDEIRA CT, FORTES SKG, CARVALHO JO de, TARTAGLIA F de L, TAMBARA AL, et al. Dimensionamento amostral e associação linear entre caracteres de Crotalaria spectabilis. Bragantia. 2017a;76(1):45-53. https://doi.org/10.1590/1678-4499.653.
TOEBE M, CARGNELUTTI FILHO A, STORCK L, LÚCIO AD. Sample size for estimation of direct effects in path analysis of corn. Genet. Mol. Res. 2017b;16(2):1-23. https://doi.org/10.4238/gmr16029523.
TOEBE M, CARGNELUTTI FILHO A, STORCK L, LÚCIO, AD. Direct effects on scenarios and types of path analyses in corn hybrids. Genet. Mol. Res. 2017c;16(1);1-15.//doi.org/10.4238/gmr16019529.
TOEBE M, MACHADO LN, TARTAGLIA FL, de CARVALHO JO, BANDEIRA CT, CARGNELUTTI FILHO A. Sample size for estimating mean and coefficient of variation in species of crotalarias. An. Acad. Bras. Cienc. 2018;90(2):1705-15. https://doi.org/10.1590/0001-3765201820170813.
WEI T, SIMKO, V. R package "corrplot": Visualization of a Correlation Matrix. Version 0.84, 2017. https://github.com/taiyun/corrplot.
Published
Versions
- 2022-01-14 (2)
- 2021-03-10 (1)
How to Cite
Issue
Section
License
Copyright (c) 2021 Ciência e Natura
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
To access the DECLARATION AND TRANSFER OF COPYRIGHT AUTHOR’S DECLARATION AND COPYRIGHT LICENSE click here.
Ethical Guidelines for Journal Publication
The Ciência e Natura journal is committed to ensuring ethics in publication and quality of articles.
Conformance to standards of ethical behavior is therefore expected of all parties involved: Authors, Editors, Reviewers, and the Publisher.
In particular,
Authors: Authors should present an objective discussion of the significance of research work as well as sufficient detail and references to permit others to replicate the experiments. Fraudulent or knowingly inaccurate statements constitute unethical behavior and are unacceptable. Review Articles should also be objective, comprehensive, and accurate accounts of the state of the art. The Authors should ensure that their work is entirely original works, and if the work and/or words of others have been used, this has been appropriately acknowledged. Plagiarism in all its forms constitutes unethical publishing behavior and is unacceptable. Submitting the same manuscript to more than one journal concurrently constitutes unethical publishing behavior and is unacceptable. Authors should not submit articles describing essentially the same research to more than one journal. The corresponding Author should ensure that there is a full consensus of all Co-authors in approving the final version of the paper and its submission for publication.
Editors: Editors should evaluate manuscripts exclusively on the basis of their academic merit. An Editor must not use unpublished information in the editor's own research without the express written consent of the Author. Editors should take reasonable responsive measures when ethical complaints have been presented concerning a submitted manuscript or published paper.
Reviewers: Any manuscripts received for review must be treated as confidential documents. Privileged information or ideas obtained through peer review must be kept confidential and not used for personal advantage. Reviewers should be conducted objectively, and observations should be formulated clearly with supporting arguments, so that Authors can use them for improving the paper. Any selected Reviewer who feels unqualified to review the research reported in a manuscript or knows that its prompt review will be impossible should notify the Editor and excuse himself from the review process. Reviewers should not consider manuscripts in which they have conflicts of interest resulting from competitive, collaborative, or other relationships or connections with any of the authors, companies, or institutions connected to the papers.