sas - Analyze Repeated Measures Data Using PROC GLIMMIX

Collectives™ on Stack Overflow

Find centralized, trusted content and collaborate around the technologies you use most.

Learn more about Collectives

Teams

Q&A for work

Connect and share knowledge within a single location that is structured and easy to search.

Learn more about Teams

I am using PROC GLIMMIX to analyze repeated measures data about specific sexual events. The original data came from a weekly diary study of about 400 people. During each week they reported on behaviours from their most recent sexual encounter. We also have basline data on their demographics. 12 weeks of observation were collected and we had a high completion rate.

I would like to create a mixed effect model, but I am unsure exactly how this is done in SAS. I want to test the effect of event-specific factors as well as some person level demographics and would like to get odds ratios for each factor of interest. The outcome is whether or not drugs were used during the event and the explanatory factors will be things like age, gender, etc. as well as characteristics about the event (i.e., partner HIV status), whether the partner was a regular sexual partner, etc..

The code I'm working with follows this pattern:

PROC GLIMMIX DATA=work.dataset oddsratio; CLASS VISIT_NUMBER PARTICIPANT_ID BINARY_EVENTLEVEL_OUTCOME BINARY_EVENTLEVEL_EXPLANATORY_FACTOR CATEGORICAL_PERSONLEVEL_EXPLANATORY_FACTOR; MODEL BINARY_EVENTLEVEL_OUTCOME = BINARY_EVENTLEVEL_EXPLANATORY CATEGORICAL_PERSONLEVEL_EXPLANATORY_FACTOR /DIST=binary link=logit CL S ddfm=kr; RANDOM ?????;


    option 1 for ?????:
    
     residual
    
    / subject=PARTICIPANT_ID


   
    option 2 for ?????: INTERCEPT / subject=PARTICIPANT_ID
   
   
    option 3 for ?????: VISIT_NUM / subject=PARTICIPANT_ID residual type=ar(1)
                   INTERCEPT / subject=VISIT_NUM(PARTICIPANT_ID)
   
   
    option 4 for ?????: Other?
   
   
    I am also unclear whether I should use ddfm=kr in my model statement or method=laplace in my proc statement -- both have been recommended elsewhere for this sort of repeated measures analysis.
   
   
    I've come across several potential options for modelling this which often give similar results, but option 1 gives a statistically significant result for an event-level, while the others give non-significant results. The inclusion of the ddfm=kr makes the result of interest more significant. The method=laplace does not allow for option 1.
   
   
    I may not be answering your question, but might be able to provide a couple of directions:
   
   
    To start with the simplest part, your
    
     MODEL
    
    statement looks correct to me as you want to test
    
     event-level factors
    
    and
    
     person-level demographics
    
    which are thus considered as
    
     fixed effects
    
    .
   
   
    Now, as far as the
    
     random effects
    
    are concerned:
   
   
    the
    
     RANDOM
    
    statements you propose for
    
     options (1) and (2)
    
    :
    

    (1)
    
     RANDOM _residual_ / subject=PARTICIPANT_ID;
    
    

    (2)
    
     RANDOM intercept / subject=PARTICIPANT_ID;
    
    

    are modeling two different parts of the random effects: the
    
     R-side
    
    and the
    
     G-side
    
    , respectively.
    

    If you are already familiar with
    
     PROC MIXED
    
    , you may want to notice that your option (1) of using
    
     RANDOM _residual_
    
    in
    
     PROC GLIMMIX
    
    is equivalent to using the
    
     REPEATED
    
    statement in
    
     PROC MIXED
    
    that tells that you have repeated measures for
    
     PARTICIPANT_ID
    
    , which is clearly your case (Ref:
    
     "Comparing the GLIMMIX and MIXED Procedures"
    
    )
   
   
    As for
    
     option (3)
    
    :
    

    
     RANDOM VISIT_NUM / subject=PARTICIPANT_ID residual type=ar(1) INTERCEPT / subject=VISIT_NUM(PARTICIPANT_ID);
    
    

    here you are modeling the time component of the repeated measures (
    
     visit_num
    
    ) as a random effect, and this should be included when you believe that there would be a random variation of the response at each of the measurements times (i.e. at each event). At first glance, I don't believe this is relevant in your case, since you are taking this into account already by the fixed effects... but of course I may be wrong by not seeing your data.
   
   
    Up to here is what I can contribute at this time.
   
   
    As
    
     next steps
    
    for you to have a better understanding, I would suggest that you:
   
   
    Read the Overview of the
    
     PROC GLIMMIX
    
    documentation, in particular the
    
     mathematical model specification
    
    and all 3 sections therein:
    

    
     The Basic Model
    
    

    
     G-Side and R-Side Random Effects and Covariance Structures
    
    

    
     Relationship with Generalized Linear Models
    
   
   
    If you are still unsure, ask your question at
    
     communities.sas.com
    
    which might be able to help you better.
   
   
    Thanks for contributing an answer to Stack Overflow!
   
   
    
     Please be sure to
     
      answer the question
     
     . Provide details and share your research!
    
   
   
    But
    
     avoid
    
    …
   
   
    
     Asking for help, clarification, or responding to other answers.
    
    
     Making statements based on opinion; back them up with references or personal experience.
    
   
   
    To learn more, see our
    
     tips on writing great answers
    
    .