Node crashed after installation of Windows 2003 Service Pack 2

Discussion in 'Clustering' started by Sabo, Eric, Jun 15, 2007.

  1. Sabo, Eric

    Sabo, Eric Guest

    We just installed new MPIO software and Windows 2003 Service Pack 2 and last
    night we had a node failed right around the time that it was doing a backup.
    Are there any post SP2 hotfixes that pertain to clustering?

    Thanks,
    Eric Sabo
     
    Sabo, Eric, Jun 15, 2007
    #1
    1. Advertisements

  2. Sabo, Eric

    Chuck [MSFT] Guest

    Let me be sure I understand this.....

    You just installed new MPIO software and Windows SP2 last night.....and you
    are asking if there are post SP2 cluster hotfixes?

    Have you considered other options before asking for post SP2 fixes for
    clustering? What was the failure on the node?

    Did the MPIO vendor certify that the update you applied works with SP2?

    The only update right now post-SP2 for clusters that we are recommending is
    for storport (Q932755)
     
    Chuck [MSFT], Jun 15, 2007
    #2
    1. Advertisements

  3. Sabo, Eric

    Sabo, Eric Guest

    Chuck,

    Yes - I have went to my vendor to discuss if their MPIO software was
    compatible with Service Pack 2, they said it was. Now I have a whole bunch
    of corruption on my volume. But what is weird there is only corruption on
    one of my two volumes.

    The nodes were rebooting and giving memory dump about every 2 hours. We
    uninstalled Service pack 2 and now they are pretty much stable but they are
    still rebooting about once every 24 hours.

    Do you think the MPIO software cause the corruption problems?

    How would one run chkdsk on a cluster resource?
     
    Sabo, Eric, Jun 17, 2007
    #3
  4. Sabo, Eric

    M W Guest

    M W, Jun 17, 2007
    #4
  5. Sabo, Eric

    M W Guest

    Hello Eric,

    What is the MPIO software you use? Which vendor is it from?

    Thanks

    [email protected]
     
    M W, Jun 17, 2007
    #5
  6. Sabo, Eric

    Chuck [MSFT] Guest

    You'll also want to use maintenance mode for the disk in question as per
    Q903650
     
    Chuck [MSFT], Jun 17, 2007
    #6
  7. Wait, you made two major changes last night and you want us to tell you what
    caused an issue? Eric, I know from your various posts on here you have more
    than a passing knowledge of clusters and engineering. But, you really have
    to be kidding here :)

    My bet is on the MPIO drivers by the way :)

    Cheers,

    Rodney R. Fournier

    MVP - Windows Server - Clustering
    http://www.nw-america.com - Clustering Website
    http://msmvps.com/clustering - Blog
    http://www.clusterhelp.com - Cluster Training
    ClusterHelp.com is a Microsoft Certified Gold Partner
     
    Rodney R. Fournier [MVP], Jun 18, 2007
    #7
  8. Sabo, Eric

    Sabo, Eric Guest

    HP..

    I think we have one volume bad within our cluster resources which is causing
    the cluster server to fail.
     
    Sabo, Eric, Jun 18, 2007
    #8
  9. Sabo, Eric

    Sabo, Eric Guest

    Chuck,

    Will users still be able to get to their shares while we run the chkdsk?

    Or does everything except for the disk need to be OFFLINE while chkdsk is
    running?
     
    Sabo, Eric, Jun 18, 2007
    #9
  10. Sabo, Eric

    Chuck [MSFT] Guest

    Nope, you will need to sked an outage and I recommend the other resources in
    the group be taken offline so no attmeps can be made.
     
    Chuck [MSFT], Jun 19, 2007
    #10
  11. Sabo, Eric

    Sabo, Eric Guest

    Rodney,

    It was just pure simple bad luck we have here, it was the corrupted
    cluster resource that was causing the memory.dmp on both nodes.

    Well, the MPIO drivers needs a certain firmware to run against the MSA
    1000, which the vendor forgot to tell us.

    Anyways things are much more stable now.
     
    Sabo, Eric, Jun 19, 2007
    #11
  12. Rodney R. Fournier [MVP], Jun 19, 2007
    #12
  13. Sabo, Eric

    Roderick Guest

    I am installing MPIO Basic for the MSA 1000. How can I be sure everything
    is working correctly? Is there a management console or something to look at?
    I have 2 Emulex HBA's, one to each controller on the MSA 1000.

    vr,
    Roderick
     
    Roderick, Jun 26, 2007
    #13
  14. Start a large file transfer and pull a fibre cable. If it continues to flow
    you are using MPIO. Then put everything back and pull the other cable. The
    vendor should have utilities for you to use.

    Cheers,

    Rodney R. Fournier
    MVP - Windows Server - Clustering
    http://www.nw-america.com - Clustering Website
    http://msmvps.com/clustering - Blog
    http://www.clusterhelp.com - Cluster Training
    ClusterHelp.com is a Microsoft Certified Gold Partner
     
    Rodney R. Fournier [MVP], Jun 26, 2007
    #14
  15. Sabo, Eric

    Roderick Guest

    That was going to be my test, HP doesn't have a management utility for their
    MPIO Basic DSM for the MSA 1000. Thanks

    Roderick
     
    Roderick, Jun 26, 2007
    #15
    1. Advertisements

Ask a Question

Want to reply to this thread or ask your own question?

You'll need to choose a username for the site, which only take a couple of moments (here). After that, you can post your question and our members will help you out.