Which statistical model should I run to get a good model












1












$begingroup$


I'm working on an analysis to analyze projected man-hours vs actual man-hours used for different teams. My data looks like this below. The software I am using is SAS. I've tried a HP linear regression taking the difference from actual man-hours minus projected man-hours and vice versa. I also ran an Anova. Both models p-value were 0.08. I'm looking for a solid model < 0.05 in my analysis of variance.



enter image description here










share|improve this question









$endgroup$












  • $begingroup$
    could you add a bit more information? why do you think such a model exists? do you have any other data about the projects/teams that could be used for modeling?
    $endgroup$
    – oW_
    5 hours ago










  • $begingroup$
    Hi Chris, welcome to the community. If you want to add more information to your question, please can edit your original question instead of posting answers. "answers" are for answers :)
    $endgroup$
    – Louis T
    1 hour ago
















1












$begingroup$


I'm working on an analysis to analyze projected man-hours vs actual man-hours used for different teams. My data looks like this below. The software I am using is SAS. I've tried a HP linear regression taking the difference from actual man-hours minus projected man-hours and vice versa. I also ran an Anova. Both models p-value were 0.08. I'm looking for a solid model < 0.05 in my analysis of variance.



enter image description here










share|improve this question









$endgroup$












  • $begingroup$
    could you add a bit more information? why do you think such a model exists? do you have any other data about the projects/teams that could be used for modeling?
    $endgroup$
    – oW_
    5 hours ago










  • $begingroup$
    Hi Chris, welcome to the community. If you want to add more information to your question, please can edit your original question instead of posting answers. "answers" are for answers :)
    $endgroup$
    – Louis T
    1 hour ago














1












1








1





$begingroup$


I'm working on an analysis to analyze projected man-hours vs actual man-hours used for different teams. My data looks like this below. The software I am using is SAS. I've tried a HP linear regression taking the difference from actual man-hours minus projected man-hours and vice versa. I also ran an Anova. Both models p-value were 0.08. I'm looking for a solid model < 0.05 in my analysis of variance.



enter image description here










share|improve this question









$endgroup$




I'm working on an analysis to analyze projected man-hours vs actual man-hours used for different teams. My data looks like this below. The software I am using is SAS. I've tried a HP linear regression taking the difference from actual man-hours minus projected man-hours and vice versa. I also ran an Anova. Both models p-value were 0.08. I'm looking for a solid model < 0.05 in my analysis of variance.



enter image description here







dataset statistics linear-regression descriptive-statistics






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked 6 hours ago









Chris KehlChris Kehl

559




559












  • $begingroup$
    could you add a bit more information? why do you think such a model exists? do you have any other data about the projects/teams that could be used for modeling?
    $endgroup$
    – oW_
    5 hours ago










  • $begingroup$
    Hi Chris, welcome to the community. If you want to add more information to your question, please can edit your original question instead of posting answers. "answers" are for answers :)
    $endgroup$
    – Louis T
    1 hour ago


















  • $begingroup$
    could you add a bit more information? why do you think such a model exists? do you have any other data about the projects/teams that could be used for modeling?
    $endgroup$
    – oW_
    5 hours ago










  • $begingroup$
    Hi Chris, welcome to the community. If you want to add more information to your question, please can edit your original question instead of posting answers. "answers" are for answers :)
    $endgroup$
    – Louis T
    1 hour ago
















$begingroup$
could you add a bit more information? why do you think such a model exists? do you have any other data about the projects/teams that could be used for modeling?
$endgroup$
– oW_
5 hours ago




$begingroup$
could you add a bit more information? why do you think such a model exists? do you have any other data about the projects/teams that could be used for modeling?
$endgroup$
– oW_
5 hours ago












$begingroup$
Hi Chris, welcome to the community. If you want to add more information to your question, please can edit your original question instead of posting answers. "answers" are for answers :)
$endgroup$
– Louis T
1 hour ago




$begingroup$
Hi Chris, welcome to the community. If you want to add more information to your question, please can edit your original question instead of posting answers. "answers" are for answers :)
$endgroup$
– Louis T
1 hour ago










4 Answers
4






active

oldest

votes


















0












$begingroup$

Nope, that’s all I want to analyze. I’m looking to see which teams are hitting the estimated time. The closer the difference is to zero the better. I also want to have visualizations to demonstrate the differences between each team. If anyone has ever analyzed the mtcars data set, where the mpg was the dependent variable and cyl was the indépendent and performed an Anova or HP linear regression, this is what I’m looking for.






share|improve this answer









$endgroup$





















    0












    $begingroup$

    This is the result of my model as a whole.enter image description here






    share|improve this answer









    $endgroup$













    • $begingroup$
      I'm trying to figure out if my model is usable.
      $endgroup$
      – Chris Kehl
      3 hours ago










    • $begingroup$
      not liking the F-value and the p-value 0.08
      $endgroup$
      – Chris Kehl
      3 hours ago



















    0












    $begingroup$

    Help me figure this out, it's from the same ANOVA but it's Levene's test of difference and two other tests. These models have a positive f-value and a p-value
    of 0.05 or less.



    enter image description here






    share|improve this answer









    $endgroup$





















      0












      $begingroup$

      As I analyze the work man-hours projected vs the man-hours actually performed, the ideal result would be zero. My plots show that there are are some work-orders close to zero, some are higher and some are lower, with a significant amount of outliers with some teams. I've posted the dataset on Kaggle https://www.kaggle.com/hashtagkehl/work-analysis



      enter image description here






      share|improve this answer









      $endgroup$













        Your Answer





        StackExchange.ifUsing("editor", function () {
        return StackExchange.using("mathjaxEditing", function () {
        StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
        StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
        });
        });
        }, "mathjax-editing");

        StackExchange.ready(function() {
        var channelOptions = {
        tags: "".split(" "),
        id: "557"
        };
        initTagRenderer("".split(" "), "".split(" "), channelOptions);

        StackExchange.using("externalEditor", function() {
        // Have to fire editor after snippets, if snippets enabled
        if (StackExchange.settings.snippets.snippetsEnabled) {
        StackExchange.using("snippets", function() {
        createEditor();
        });
        }
        else {
        createEditor();
        }
        });

        function createEditor() {
        StackExchange.prepareEditor({
        heartbeatType: 'answer',
        autoActivateHeartbeat: false,
        convertImagesToLinks: false,
        noModals: true,
        showLowRepImageUploadWarning: true,
        reputationToPostImages: null,
        bindNavPrevention: true,
        postfix: "",
        imageUploader: {
        brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
        contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
        allowUrls: true
        },
        onDemand: true,
        discardSelector: ".discard-answer"
        ,immediatelyShowMarkdownHelp:true
        });


        }
        });














        draft saved

        draft discarded


















        StackExchange.ready(
        function () {
        StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f45414%2fwhich-statistical-model-should-i-run-to-get-a-good-model%23new-answer', 'question_page');
        }
        );

        Post as a guest















        Required, but never shown

























        4 Answers
        4






        active

        oldest

        votes








        4 Answers
        4






        active

        oldest

        votes









        active

        oldest

        votes






        active

        oldest

        votes









        0












        $begingroup$

        Nope, that’s all I want to analyze. I’m looking to see which teams are hitting the estimated time. The closer the difference is to zero the better. I also want to have visualizations to demonstrate the differences between each team. If anyone has ever analyzed the mtcars data set, where the mpg was the dependent variable and cyl was the indépendent and performed an Anova or HP linear regression, this is what I’m looking for.






        share|improve this answer









        $endgroup$


















          0












          $begingroup$

          Nope, that’s all I want to analyze. I’m looking to see which teams are hitting the estimated time. The closer the difference is to zero the better. I also want to have visualizations to demonstrate the differences between each team. If anyone has ever analyzed the mtcars data set, where the mpg was the dependent variable and cyl was the indépendent and performed an Anova or HP linear regression, this is what I’m looking for.






          share|improve this answer









          $endgroup$
















            0












            0








            0





            $begingroup$

            Nope, that’s all I want to analyze. I’m looking to see which teams are hitting the estimated time. The closer the difference is to zero the better. I also want to have visualizations to demonstrate the differences between each team. If anyone has ever analyzed the mtcars data set, where the mpg was the dependent variable and cyl was the indépendent and performed an Anova or HP linear regression, this is what I’m looking for.






            share|improve this answer









            $endgroup$



            Nope, that’s all I want to analyze. I’m looking to see which teams are hitting the estimated time. The closer the difference is to zero the better. I also want to have visualizations to demonstrate the differences between each team. If anyone has ever analyzed the mtcars data set, where the mpg was the dependent variable and cyl was the indépendent and performed an Anova or HP linear regression, this is what I’m looking for.







            share|improve this answer












            share|improve this answer



            share|improve this answer










            answered 5 hours ago









            Chris KehlChris Kehl

            559




            559























                0












                $begingroup$

                This is the result of my model as a whole.enter image description here






                share|improve this answer









                $endgroup$













                • $begingroup$
                  I'm trying to figure out if my model is usable.
                  $endgroup$
                  – Chris Kehl
                  3 hours ago










                • $begingroup$
                  not liking the F-value and the p-value 0.08
                  $endgroup$
                  – Chris Kehl
                  3 hours ago
















                0












                $begingroup$

                This is the result of my model as a whole.enter image description here






                share|improve this answer









                $endgroup$













                • $begingroup$
                  I'm trying to figure out if my model is usable.
                  $endgroup$
                  – Chris Kehl
                  3 hours ago










                • $begingroup$
                  not liking the F-value and the p-value 0.08
                  $endgroup$
                  – Chris Kehl
                  3 hours ago














                0












                0








                0





                $begingroup$

                This is the result of my model as a whole.enter image description here






                share|improve this answer









                $endgroup$



                This is the result of my model as a whole.enter image description here







                share|improve this answer












                share|improve this answer



                share|improve this answer










                answered 3 hours ago









                Chris KehlChris Kehl

                559




                559












                • $begingroup$
                  I'm trying to figure out if my model is usable.
                  $endgroup$
                  – Chris Kehl
                  3 hours ago










                • $begingroup$
                  not liking the F-value and the p-value 0.08
                  $endgroup$
                  – Chris Kehl
                  3 hours ago


















                • $begingroup$
                  I'm trying to figure out if my model is usable.
                  $endgroup$
                  – Chris Kehl
                  3 hours ago










                • $begingroup$
                  not liking the F-value and the p-value 0.08
                  $endgroup$
                  – Chris Kehl
                  3 hours ago
















                $begingroup$
                I'm trying to figure out if my model is usable.
                $endgroup$
                – Chris Kehl
                3 hours ago




                $begingroup$
                I'm trying to figure out if my model is usable.
                $endgroup$
                – Chris Kehl
                3 hours ago












                $begingroup$
                not liking the F-value and the p-value 0.08
                $endgroup$
                – Chris Kehl
                3 hours ago




                $begingroup$
                not liking the F-value and the p-value 0.08
                $endgroup$
                – Chris Kehl
                3 hours ago











                0












                $begingroup$

                Help me figure this out, it's from the same ANOVA but it's Levene's test of difference and two other tests. These models have a positive f-value and a p-value
                of 0.05 or less.



                enter image description here






                share|improve this answer









                $endgroup$


















                  0












                  $begingroup$

                  Help me figure this out, it's from the same ANOVA but it's Levene's test of difference and two other tests. These models have a positive f-value and a p-value
                  of 0.05 or less.



                  enter image description here






                  share|improve this answer









                  $endgroup$
















                    0












                    0








                    0





                    $begingroup$

                    Help me figure this out, it's from the same ANOVA but it's Levene's test of difference and two other tests. These models have a positive f-value and a p-value
                    of 0.05 or less.



                    enter image description here






                    share|improve this answer









                    $endgroup$



                    Help me figure this out, it's from the same ANOVA but it's Levene's test of difference and two other tests. These models have a positive f-value and a p-value
                    of 0.05 or less.



                    enter image description here







                    share|improve this answer












                    share|improve this answer



                    share|improve this answer










                    answered 3 hours ago









                    Chris KehlChris Kehl

                    559




                    559























                        0












                        $begingroup$

                        As I analyze the work man-hours projected vs the man-hours actually performed, the ideal result would be zero. My plots show that there are are some work-orders close to zero, some are higher and some are lower, with a significant amount of outliers with some teams. I've posted the dataset on Kaggle https://www.kaggle.com/hashtagkehl/work-analysis



                        enter image description here






                        share|improve this answer









                        $endgroup$


















                          0












                          $begingroup$

                          As I analyze the work man-hours projected vs the man-hours actually performed, the ideal result would be zero. My plots show that there are are some work-orders close to zero, some are higher and some are lower, with a significant amount of outliers with some teams. I've posted the dataset on Kaggle https://www.kaggle.com/hashtagkehl/work-analysis



                          enter image description here






                          share|improve this answer









                          $endgroup$
















                            0












                            0








                            0





                            $begingroup$

                            As I analyze the work man-hours projected vs the man-hours actually performed, the ideal result would be zero. My plots show that there are are some work-orders close to zero, some are higher and some are lower, with a significant amount of outliers with some teams. I've posted the dataset on Kaggle https://www.kaggle.com/hashtagkehl/work-analysis



                            enter image description here






                            share|improve this answer









                            $endgroup$



                            As I analyze the work man-hours projected vs the man-hours actually performed, the ideal result would be zero. My plots show that there are are some work-orders close to zero, some are higher and some are lower, with a significant amount of outliers with some teams. I've posted the dataset on Kaggle https://www.kaggle.com/hashtagkehl/work-analysis



                            enter image description here







                            share|improve this answer












                            share|improve this answer



                            share|improve this answer










                            answered 3 hours ago









                            Chris KehlChris Kehl

                            559




                            559






























                                draft saved

                                draft discarded




















































                                Thanks for contributing an answer to Data Science Stack Exchange!


                                • Please be sure to answer the question. Provide details and share your research!

                                But avoid



                                • Asking for help, clarification, or responding to other answers.

                                • Making statements based on opinion; back them up with references or personal experience.


                                Use MathJax to format equations. MathJax reference.


                                To learn more, see our tips on writing great answers.




                                draft saved


                                draft discarded














                                StackExchange.ready(
                                function () {
                                StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f45414%2fwhich-statistical-model-should-i-run-to-get-a-good-model%23new-answer', 'question_page');
                                }
                                );

                                Post as a guest















                                Required, but never shown





















































                                Required, but never shown














                                Required, but never shown












                                Required, but never shown







                                Required, but never shown

































                                Required, but never shown














                                Required, but never shown












                                Required, but never shown







                                Required, but never shown







                                Popular posts from this blog

                                Ponta tanko

                                Tantalo (mitologio)

                                Erzsébet Schaár