We have received over 2000 reponses from an online survey we ran.

I am intersted to see if there is a relationship between two questions...i.e time spent performing a certain task and no: of times athletes drank water when performing the task. There were five options for both.

My question to the forum is, what statsistcal test (SPSS) could we use to answer this question.

https://ibb.co/5WDvNHK

There are two groups, trying to figure out and justify the equation.

Mean for x1: 49.974

Mean for x2: 45.02

st. dev 1: .25

st. dev 2: .46

Is my math tracking? Feel like I'm missing something. Shout out to anyone willing to help on a friday night.]]>

How could you solve...

How would you test this]]>

I have some data, and I am not sure which statistical test I should use. The data comes from a survey.

In the survey the respondents (dentists) was presented for two different cases in which the cases differed slightly in how they would carry out an hypothetical case - would they perform surgery or not based upon the patients condition (the answers was categorical (5 in total), but I have transformed them into numbers ranging 0-1, because three of the answers meant no surgery(0) and two...

Help - which test should I use?]]>

The probability distribution below is representative of x, the number of cars sold within a single day.

x = 0, 1, 2, 3, 4

P(x) = 0.1, 0.4, 0.3, 0.2

I calculated the standard deviation and mean, which I believe 0.917 and 1.6 respectively.

The part I am struggling with is:

Finding the probability of two randomly selected days having sold less than 3 cars.

Selecting 100 random days, what is the probability of at most 50%...

Probability Distribution of Cars Sold]]>

"The censoring nature of earning information in national surveys and the administrative case reporting system should be taken into consideration in developing empirical specifications of wages. For example, in RSA-911 data, clients’ earnings after receipt of services are only discernible for those clients whose cases were successfully closed with the code “Exited with Employment Outcome.” Unfortunately, individuals who exited the VR system without any employment...

Censored data]]>

I'm currently working on my thesis, which is an analysis of the effects of social cycling on cyclists' stress levels.

An experiment with a total of 21 participants was conducted. The Experiment was about a comparison between single cycling vs group cycling.

Single rides and group rides were performed while measuring the GSR (stress) data, in order to notice if there are any effects or changes on stress when riding in a group compared to riding alone.

Every participant has to...

two way ANOVA?]]>

I am currently studying for university exams and stumbled upon a theoretical question while my mind was wandering (the question having nothing to do with my studies as I am in the accounting department).

It has to do with multiple choice question (MCQ) tests. I wanted to figure out what the probability is for achieving a certain score on a MCQ test is if you have no idea which answers are correct and thus answered the test blindly.

Suppose that the test consists of

Results on blind MCQ test]]>

List 1:

Code:

`[2, 5, 6, 8, 13, 14, 15, 18, 19, 20, 22, 24, 32, 34, 35, 39, 42, 43, 45, 46, 47, 50, 52, 53, 54, 55, 56, 57, 58, 60, 62, 66, 67, 71, 72, 73, 74, 76, 79, 80]`

Code:

`[[1, 2, 72], [30, 76, 79], [9, 18, 74], [24, 28, 55], [2, 16, 49], [8, 64, 73], [29, 38, 58], [48, 56, 69], [7, 12, 14], [17, 34, 76], [16, 23, 46], [35, 60, 70], [33, 66, 80], [36, 59, 66], [68, 70, 75], [28, 60, 71]...`

predicting with interval data with a limited number of levels.]]>

I would like to know how can one test the linearity assumption in a multiple regression model?

I know that for a simple regression, this assumption is tested visually by examining the scatter plot between X = predictor Y = outcome, with the dots forming an ellipse and being aligned on the regression line.

However, I am not sure how to do this in a multiple regression model. Should the residuals vs. fitted curve have a specific shape?

Anyone ever come across literature like this?]]>

I was wondering if anyone could shed some light on a current situation I have encountered. I have a study set-up with 13 participants that I am examining over 5 time-points, hence, I intend to conduct a one-way ANOVA. However, on one of the time-points, a participant was unable to attend the situation so I now have a missing case but given there only being 13 participants I am reluctant to remove the participant from the data set. Could anyone please advise what to do in this...

ANOVA with missing case]]>

I'm fairly new to R and have been working in psychometrics for the past year or so.

I just had a quick question about using the Pearson product moment correlation.

I've developed a questionnaire that attempts to answer the question: can we evaluate soft skills based on online activity? One of those skills, for example, is communication. That variable is assigned a value of 0 (poor communication) to 5 (excellent communication), so I'm really dealing with ordinal data. What I'm...

Assessing Concurrent Validity]]>

I am working on a study investigating emotional dysregulation in healthy volunteers. My aim is to propose a survey - screening questionnaire - DERS - Difficulties in Emotional Reglation Scale to a sufficient number of people and afterwards categorise the subjects into two extreme groups: (1) high emotional dysregulation and (2) low emotional dysregulation.

How should I calculate my sample size? how many people should I include in my experiments, after the screening? And how many...

sample size]]>

Study 1:

STUDY 3:

What Statistical Tests should I run]]>

https://www.surveymonkey.com/mp/sample-size-calculator

I compiled it like this for a survey of italian's habit:

Population size (Italian): 60,000,000

Confidence level: 95%

Margin of error: 5%

The result is 385, but I would like to divide it into males and females and by 10 age groups each.

If I do 385/20 = 19.25 is that correct?

Then administer the questionnaire to 19.25 males between 20 and 25...

Calculate the sample number by age and gender]]>

What statistical test should I run Study 3]]>

60 participants (within those, 20 university participants) Participants randomly assigned to one of three images (human, humanoid, and

monster).

3 Conditions: Between-subjects design = to increase number of 36 images per condition (108 images altogether)

Human

Humanoid

Monster

What Statistical analysis should I run Study 1]]>

I produced some data during an experiment but I am not sure which statistical test I should use. I hope that you could help me with this.

Globally, I expose a defined number of cells to different drugs formulations in order to assay their toxicity (it's an MTT assay for whom is familiar with it).

So the cells are exposed to different formulations at various concentrations for 2 timepoint (24h and 48h). Consequently, I assume that my experiment count 3 predictor variables...

Statistical test selection]]>

I have a sample of men and women based on their age (21,....76) and I have their willingness (yes, no, not sure/1,2,3). I want to see if the age affects the willingness, or if age shows a significant effect/association with willingness.

Is an independent t test applicable here?

Which statistical test could I use?

For our master thesis, me and my partner want to construct an event study, but the packages "estudy2" & "eventstudy" does not seem to get around with some aspects. I wonder if there are someone out here on this forum that could help get around those issues. We are struggeling quite as lot so we would be very thankful for your help!

We have all data on stock returns and market indexes. Variable name for country and the market index is the same.

Specifically the aspects I cant get...

estudy2 & eventstudy cant offer me what I want]]>

For this research we are looking for a database with data on the employee satisfaction of different companies (preferably within Europe, but not necessarily). We have not found this...

Employee satisfaction database]]>

Is there a good test for overfitting?]]>

This is the code

PROC LOGISTIC DATA=WORK.SORTTempTableSorted

PLOTS(ONLY)=ALL

;

CLASS pd2 (PARAM=REF) pd1 (PARAM=REF) pd3 .....;

MODEL DVD (Event = '1')=pd1 pd2 pd3 pd4 pd5 pd6 pd7 pd8 pd9 pd10 pd11 pd12 pd13 pd15 pd16 pd17 pd18 pd19 pd20 pd21 pd22 pd23 pd24 pd25 pd26 pd27 pd28 pd29 pd30 pd31 pd14 /

SELECTION=NONE

LINK=LOGIT

;

RUN...

Reference coding in SAS Proc Logistics]]>

I am currently working on a project to evaluate the efficacy of one chemical product to reduce the development of a fungal pathogen on a field.

To evaluate this I have 12 plots of 1m² each, 6 are treated every month and 6 not. Randomly disposed.

After several months of trial, I now have results every month of the fungal contamination on each plot.

The results are for example :

Plot A treated, end of july --> 5 fungal infections, that covers an area of 20 cm² and...

Statistical analysis of diseases]]>

I'm a research assistant and a new SPSS user and I have a question about a study.

I have collect performances to several cognitive tasks, pre and post 4 farmacological treatment, by the same group (7 sbjs) How can I insert all the data (gender, farmacological treatment, task's results) to observe differences related to farms? I want to observe differences, between subjects, in PRE and POST condition. Which kind of analysis can I use? Probably, a non parametric anova test.]]>

Statistical tests for a pre-post study]]>

For...

Analysis for randomized controlled trial with moderating variable?]]>

I am trying to help a classmate with their research and I am experiencing some difficulty. I think the most appropriate test to use would be the Chi-Square test for Independence. The data I was sent is below. I do not know what to place on the rows or columns since the table appears to contain multiple column labels. Would the values included be the observed? If so, what would the expected values be?

I am in search of some guidance and direction. I work in the field of health and safety risk, forgive me if I sound completely uneducated in the subject of stats, because I am.

I am looking for some statistical methods to provide my work some predictive analysis on health and safety risks... So if I put it in lay terms, I want to look at things such as historical incident and injury statistics and other forms of subjective data like risk assessments on a heat map to provide some...

Stats Novice need help!]]>

"From the above discussion a general principle emerges. Whenever the response variable is informally or formally linked, in an unmasked fashion, to particular parameters that may be deleted from the model, special adjustments must be made in P-values, standard errors, test statistics, and confidence limits, in order for these statistics to have the correct interpretation...

Removing terms from a model]]>

Study Design & Power for Pain Research]]>

I can understand what s going on, except for the last part how inv(V(kron)U) becomes inv(U) for multivariate distribution. is there any source you can help me with how to get multivariate pdf from matrix variate?]]>

We also have customers who do not fill out all of the questions. This is missing data. My question is at what point (what percent of data missing) should you start getting concerned? Most of this...

Missing data satisfaction research]]>

Also posted in Sample of a sample - Cross Validated (stackexchange.com)

If we were to carry out a survey on say, workers in a chemical industry asking for responses such as:

- Are you a smoker? Y/N
- Is your BMI over 25? Y/N
- Do you have asthma?
- Did you diagnose asthma whilst in the chemical industry?
- Do you have cancer?
- Did you diagnose cancer whilst in the chemical industry...

Sample from a sample]]>

I found my answers to the questions asked were uncomfortably arbitrary, and believed the results would probably have to go through some sort of normalization procedure. Something along the lines of weighted scaling of each numerical response, according to frequency of that response. Being mathematically inclined, my dissatisfaction with my own answers irked...

Converting interview response data to aggregate research data]]>

the coin is returned with an additional $1 coin, otherwise the original coin is

lost. The probability of winning is 1/2 unless the previous play has resulted

in a win, in which case the probability is p < 1/2. If the cost of maintaining

the machine averages $c per play (with c < 1/3), give conditions on the value

of p that the owner of the machine must arrange in order to make a prot in

the long run.]]>

Another way is to specify three dummies for the four levels and then specify interaction effects...

Interaction effects]]>

"When the dependent variable is ordinal or continuous, classification through forced up-front dichotomization in an attempt to simplify the problem results in arbitrariness and major information loss even when the optimum...

Making an ordinal response variable binary]]>

Error in unpackPkgZip(foundpkgs[okp, 2L], foundpkgs[okp, 1L], lib, libs_only, :

ERROR: failed to lock directory ‘\\fldvr.net\hq\Users\rthellein\Documents\R\win-library\4.0’ for modifying

Try removing ‘\\fldvr.net\hq\Users\rthellein\Documents\R\win-library\4.0/00LOCK’]]>

I am doing comparative survival analysis between actual overall survival and predicted overall survival by web-based tool.

I need to calculate

I am putting

However, I am not sure which

I can get the results from

Kaplan Meier Survival analysis in SPSS]]>