Bring unrelated columns into a new table

GLS · May 1, 2022, 4:36pm

Hello,
I have an epidemiological dataset that I would like to prepare for analysis.
Currently the data has a group identifier in the first column (CL/P vs CP) and data is contained in the following columns.

Below a short example:

Type	BW	F age birth	M age birth	F age conc	M age conc
CL/P	3060.00	24.41	23.33	23.74	22.58
CP	3080.00	30.91	30.41	30.16	29.58
CP	3820.00	34.41	34.66	33.66	33.91
CP	3620.00	43.91	28.83	43.83	28.83
CL/P	3260.00	35.41	34.16	34.66	33.41
CP	2280.00	36.83	35.33	36.16	34.58
CL/P	4070.00	43.33	34.33	42.58	33.58
CP	3200.00	34.49	31.83	33.74	30.41

To be processed in my statistical package that unfortunately does not support grouping like SPSS, I need to have the data as follows (different data from above):

CLP BW	CLP F age birth	CLP M age birth	CLP F age conc	CLP M age conc	CP BW	CP F age birth	CP M age birth	CP F age conc	CP M age conc
2530	31.91	29.99	31.24	29.33	3080	30.91	30.41	30.16	29.58
3060	24.41	23.33	23.74	22.58	3820	34.41	34.66	33.66	33.91
3260	35.41	34.16	34.66	33.41	3620	43.91	28.83	43.83	28.83
4070	43.33	34.33	42.58	33.58	2280	36.83	35.33	36.16	34.58
3535	36.41	34.16	35.66	33.41	3200	34.49	31.83	33.74	30.41
3750	33.16	35.83	32.41	34.99	2875	39.91	34.58	39.24	33.83
3570	34.66	28.41	33.91	27.66	3180	40.91	35.83	40.16	34.41

The columns will have a different number of rows depending on missing values and differences in frequencies in the groups.

I can filter and segregate the CL/P from the CP, but how can I then bring together the filtered columns from two different branches to create a new table? With a pivot I get to the result for a single variable but I have empty cells that I cannot simply delete shifting up the cells below (wish for a future release…)

Currently I filter for CL/P and export the data and redo for CP.

Is there a smarter way?

Thank you in advance,

GLS

Admin · May 1, 2022, 10:36pm

I don’t understand how you decide which decide which CP and CL/P to put on the same row. Is it just first CP and first CL/P goes on line 1? If so, you can do it like this:

GLS.transform (4.4 KB)

GLS · May 2, 2022, 5:27am

Thank you for your reply - I tried that version already. Unfortunately it keeps only the lowest number of rows (3 of the 5 CP rows, 2 CP missing).

The problem is that CL/P and CP contain data from independent cases (ie: A1= Smith, A2=Johnson, B1=Fischer, B2=Muller, …) Each cell of each of those two column has data from different cases.
And data needs to be arranged as a column with ALL the cases.

Admin · May 2, 2022, 8:39am

I’m still not sure I understand 100% what you want. If the issue is that there are a different number of CP and CL/P rows, then you can fix that in the Join:

Does that help? if not, can you show what the output should look like for a particular input. Then I might get a clearer idea.

GLS · May 2, 2022, 11:10am

THANK YOU very much.
It does exactly what I needed.
Thank you again for EDT!