How do you aggregate a data frame in R without dropping cases that are missing on some (but not all) variables?

More Dan Lee's questions See All

How do I interpret an interaction term without significant marginal effects?

I estimated a multiple regression model with an interaction term. The interaction was statistically significant. However, when examining the Johnson-Neyman plot, the marginal effect (across all...

03 October 2019 1,794 0 View

How do you add percentages within a bar chart with 13 yes/no variables in the X-axis?

Hi, I have 13 survey items with the response options YES and NO. For example, the respondent may indicates YES or NO when asked if they have a reliable method of transportation. I would like to...

31 July 2019 3,980 3 View

How would you use standard deviations to weight averages?

Our response variable (Y) is a biomarker of stress that was measured three times (reliability check). Researchers in the field tend to take the average score of the observed score across 3...

06 July 2019 189 1 View

How to determine unidimensionality or multidimensionality from conflicting factor analysis results?

I am examining results from an exploratory factor analysis (using Mplus) and it seems like the two-factor solution fits the data better than the one factor solution (per the RMSEA, chi-square LRT,...

09 June 2019 4,388 7 View

Is It OK to use Spearman Rho or Kendall Tau correlation when some categories on the Likert scale have zero count?

I would like to examine the association between two survey questions asked to doctors. The first question asks whether they screened their patients for social needs in the past year (e.g.,...

14 April 2019 7,866 3 View

Which type of compound does lamda max of 218 indicate in a uv-vis spectrum of a partially purified compound through column and TLC?

A crude extract of fungal culture using EtOH was subjected to column and TLC and partially purified compound was obtained. UV vis spectrum of the compound/s has max absorbance at 218nm. The...

11 August 2024 9,801 2 View

How to learn more about SPSS and its Application?

I would like to learn more about SPSS and Its application especially in regards to data analysis. Please suggest me how I can learn more about it. Thank you so much.

11 August 2024 9,101 4 View

Can you connect an HPLC to a Mass Spec only at a certain time point?

Can anyone explain this method? Especially the last statement where it says only at 1.5 to 2.5mins was the MS/MS connected to the UPLC. How is that possible, is it a feature in this specific...

11 August 2024 8,141 3 View

Handling Missing Data and Building a Predictive Model with Incomplete Information ?

I am developing a predictive model for a water supply network that involves 20 influencing points. However, I only have historical data for 10 out of these 20 points. I would like to know how to...

10 August 2024 4,005 2 View

Has anyone applied Python in the field of textile engineering for data analysis, automation, or smart textiles?

I'm currently exploring the application of Python in textile engineering, specifically in areas like data analysis, process automation, and the development of smart textiles. I'm interested in...

10 August 2024 7,429 2 View

Is this a facetotecta nauplius?

This larva was captured using a plankton net in the Persian Gulf during the summer. I believe it may be a Facetotecta nauplius.

08 August 2024 3,746 4 View

GC-MS retention index prediticon?

Hello experts, Does anyone know any free software about retention index prediction ?

08 August 2024 7,403 2 View

RNA Extraction Using Hot Borate Method No Longer Working?

I've been performing RNA extraction on cotton petiole tissue for a few months now using the method described in the following paper, a derivative of the typical hot borate method...

08 August 2024 9,882 2 View

Can I use a HisTRAP column for affinity chromatography?

I'm working on selecting antibodies against a recombinant protein that has a His-tag. My idea is to first bind the recombinant protein to a HisTRAP column and then use this column for an affinity...

07 August 2024 505 3 View

Is there an alternative to a multinomial regression which allows the DV to be non mutually exclusive?

I am trying to analyse data from a survey examining what variables affect teachers perceived barriers to incorporating technology into their classroom. I have 5 predictor variables however my DV...

06 August 2024 1,752 3 View

Philip Cochetti

You might consider using the dplyr package within the "tidyverse".

Using your example:

library(tidyverse)

output %

group_by( id1, id2 ) %>%

summarize( mn = mean( x ), N = n( x ) )

The group_by will include NA entries by default. These can be dropped by including using the option .drop = TRUE in the group by function.

Dan Lee

Thanks much, Philip Cochetti ! I shall try this approach.

Md. Shaddam Hossain Bagmar

You can simply use the `subset' command to select a part of the full data set with certain conditions.

dfagg1