4 months ago
Monday, February 2, 2009
Enhanced R for Linux Users
I have recently been using Rkward, an opensource graphical user interface for R that runs on Linux. Windows users can only hope that this will be released in a Windows format; as this is probably the most sophisticated, yet user friendly, R graphical user interface available. This is particularly true with the latest version. Despite access to point-and-click options for some of the basic commands; the primary utility is the script input and output editor/viewer which are great for storing and organizing syntax for projects. The only program I can think of that compares to this is EMACS, which has a steep learning curve. Want to try this? First-- have a machine running Linux. I currently run opensuse 11.1 on one of my computers and would recommend this OS for users looking to switch from Windows to Linux. If you have linux, just use the link above for download-- or go to the package manager in your distribution.
Monday, December 29, 2008
Chapter 2, Exercise 6, pg 33-34 (basic bivariate regression)
Exercise 6 has been posted on the companion web page. This exercise covers bivariate regression in the R statistical computing program. First a brief example of how to manually calculate regression coefficients using formulas 2.4.3 and 2.4.4 is provided. Then, the linear model function lm() is used to specify a basic regression model. The summary() function is then used to show model results. This exercise uses the data set C0206DT from the book's CD.
Advanced Structural Equation Modeling modeling in R
This Summer the researchers Boker, Neal, and Maes from the Unviersity of Virginia will be releasing the first beta test of their Structural Equation Modeling (SEM) software, OpenMX. OpenMX is a port and upgrade of the late statistical modeling program MX which will now be integrated into the R statistical computing environment. This program will provide a large increase in the accessibility, efficiency, and utility of the MX software. The OpenMX team promises extensive documentation so that students having only entry level knowledge of SEM will be able to learn and use this software.
Advanced options and estimators ranging from multi-level modeling, nuero imaging, dynamical systems, and mixed modeling to Full Information Maximum Likelihood estimation will be available in OpenMx. This software will also provide a graphical user interface in which models can be specified by path diagrams. This will be a massive improvement on the availability of structural equation modeling software in R.
Closed beta starts in January (2009) and open beta will begin this summer (2009).
See documentation here
Website here
Advanced options and estimators ranging from multi-level modeling, nuero imaging, dynamical systems, and mixed modeling to Full Information Maximum Likelihood estimation will be available in OpenMx. This software will also provide a graphical user interface in which models can be specified by path diagrams. This will be a massive improvement on the availability of structural equation modeling software in R.
Closed beta starts in January (2009) and open beta will begin this summer (2009).
See documentation here
Website here
Saturday, December 20, 2008
Chapter 2, Exercise 5, pg 45,49, 54,55 (correlation continued)
Posted on the companion web page is exercise 5, which covers how to obtain confidence intervals, and tests of significance for the basic bivariate correlation, the Spearman rank order correlation, the point biserial correlation, and the phi coefficient using the cor.test function in the R statistical computing program.
This covers pages 45 and 49 in the book and expands the exposition past basic bivariate correlation.
The second part of this exercise covers polychoric and polyserial correlation which is described on page 54 and 55 in the text. Specifically the book covers biserial correlation (not to be confused with point biserial correlation) and tetrachoric correlation. Both of these correlations concern variables (either one or two) that are artificially dichotomous. An example of an artificially dichotomous variable could be a variable that concerns a population that is best described on a continuum, yet is categorical in a data set (e.g. depressed, not depressed). The R package called polycor is used for these coefficients.
This covers pages 45 and 49 in the book and expands the exposition past basic bivariate correlation.
The second part of this exercise covers polychoric and polyserial correlation which is described on page 54 and 55 in the text. Specifically the book covers biserial correlation (not to be confused with point biserial correlation) and tetrachoric correlation. Both of these correlations concern variables (either one or two) that are artificially dichotomous. An example of an artificially dichotomous variable could be a variable that concerns a population that is best described on a continuum, yet is categorical in a data set (e.g. depressed, not depressed). The R package called polycor is used for these coefficients.
Friday, December 12, 2008
Some Useful Links for Psychological Research In R
I have posted some useful links on the bottom right hand side of the blog. All of these links concern R applications for psychology. You will find R syntax, packages, and other resources related to psychological research at each of these pages. below are the links and a brief description:
http://www.statmethods.net/index.html :This site contains a wealth of information on R. This is an excellent site to bookmark for R questions. This covers everything from data manipulation and basic statistics to advanced statistics and advanced graphical applications.
http://www.personality-project.org/r/ :this site covers the psych package for R. This package provides some basic statistical applications of R that are pertinent to the psychological sciences.
http://cran.r-project.org/web/packages/QuantPsyc/QuantPsyc.pdf :this is the manual for the QuantPsyc R package which contains several functions that aid in investigating statistical mediation, moderation, and multivariate assumptions.It also has a useful application of the boot command for bootstrapping parameters of basic mediation models.
http://cran.r-project.org/doc/manuals/R-intro.pdf :this is the manual for statistical applications in R. It can be rather confusing, yet will make navigating R much easier if it is read.
http://cran.r-project.org/doc/contrib/Baron-rpsych.pdf :this is a manual for conducting research with survey data. It covers the basics of manipulating and investigating such data (making composites, finding various reliabilities, etc.).
http://gking.harvard.edu/amelia/ :This is arguably the most user friendly and sophisticated multiple imputation program for R. It can be ran in or outside of R. This program imputes data under the assumption of Missing at Random, and uses a bootstrapping based EM algorithm. This is a great program to quickly and easily deal with missing data. It is recommended that the user read the manual before use.
http://www.statmethods.net/index.html :This site contains a wealth of information on R. This is an excellent site to bookmark for R questions. This covers everything from data manipulation and basic statistics to advanced statistics and advanced graphical applications.
http://www.personality-project.org/r/ :this site covers the psych package for R. This package provides some basic statistical applications of R that are pertinent to the psychological sciences.
http://cran.r-project.org/web/packages/QuantPsyc/QuantPsyc.pdf :this is the manual for the QuantPsyc R package which contains several functions that aid in investigating statistical mediation, moderation, and multivariate assumptions.It also has a useful application of the boot command for bootstrapping parameters of basic mediation models.
http://cran.r-project.org/doc/manuals/R-intro.pdf :this is the manual for statistical applications in R. It can be rather confusing, yet will make navigating R much easier if it is read.
http://cran.r-project.org/doc/contrib/Baron-rpsych.pdf :this is a manual for conducting research with survey data. It covers the basics of manipulating and investigating such data (making composites, finding various reliabilities, etc.).
http://gking.harvard.edu/amelia/ :This is arguably the most user friendly and sophisticated multiple imputation program for R. It can be ran in or outside of R. This program imputes data under the assumption of Missing at Random, and uses a bootstrapping based EM algorithm. This is a great program to quickly and easily deal with missing data. It is recommended that the user read the manual before use.
Thursday, December 11, 2008
Chapter 2, Exercise 4; pg 31: table 2.3.2 (CCAW)
Posted on the companion web page is the 4th exercise for chapter 2, which shows how to create the phi correlation coefficient (i.e. correlation between two dichotomous variables) and the spearman rank order correlation (i.e. correlation between data that are rank ordered). Use data sets C0204dt.txt and C0204DT.txt from the Cohen Cohen Aiken West text.
Chapter 2, Exercise 3; pg 30 table 2.3.1
The 3rd exercise has been posted on the the companion web page. This exercise covers how to create the point biserial correlation coefficient in the R statistical computing program.
This exercise is the R counterpart to exercise CH02EX02 in the book;however it is the third exercise on this blog. The next post will cover how to obtain the phi-coefficient and the rank correlation. Use the data set C02030DT.txt from your book's CD.
This exercise is the R counterpart to exercise CH02EX02 in the book;however it is the third exercise on this blog. The next post will cover how to obtain the phi-coefficient and the rank correlation. Use the data set C02030DT.txt from your book's CD.
Subscribe to:
Comments (Atom)