is there any data tidying tool for python/pandas similar to R tidyr tool?












14












$begingroup$


I'm working on a Kaggle challenge where some variables are represented by rows instead of columns (Telstra Network Disruption). I am currently searching for the equivalent of gather(), separate() and spread(), which can be found in R tidyr tool.










share|improve this question









$endgroup$








  • 1




    $begingroup$
    github.com/pydata/pandas/issues/10109
    $endgroup$
    – Emre
    Mar 2 '16 at 16:29
















14












$begingroup$


I'm working on a Kaggle challenge where some variables are represented by rows instead of columns (Telstra Network Disruption). I am currently searching for the equivalent of gather(), separate() and spread(), which can be found in R tidyr tool.










share|improve this question









$endgroup$








  • 1




    $begingroup$
    github.com/pydata/pandas/issues/10109
    $endgroup$
    – Emre
    Mar 2 '16 at 16:29














14












14








14


2



$begingroup$


I'm working on a Kaggle challenge where some variables are represented by rows instead of columns (Telstra Network Disruption). I am currently searching for the equivalent of gather(), separate() and spread(), which can be found in R tidyr tool.










share|improve this question









$endgroup$




I'm working on a Kaggle challenge where some variables are represented by rows instead of columns (Telstra Network Disruption). I am currently searching for the equivalent of gather(), separate() and spread(), which can be found in R tidyr tool.







r python dataset data-cleaning pandas






share|improve this question













share|improve this question











share|improve this question




share|improve this question










asked Mar 2 '16 at 8:54









cpumarcpumar

392313




392313








  • 1




    $begingroup$
    github.com/pydata/pandas/issues/10109
    $endgroup$
    – Emre
    Mar 2 '16 at 16:29














  • 1




    $begingroup$
    github.com/pydata/pandas/issues/10109
    $endgroup$
    – Emre
    Mar 2 '16 at 16:29








1




1




$begingroup$
github.com/pydata/pandas/issues/10109
$endgroup$
– Emre
Mar 2 '16 at 16:29




$begingroup$
github.com/pydata/pandas/issues/10109
$endgroup$
– Emre
Mar 2 '16 at 16:29










3 Answers
3






active

oldest

votes


















4












$begingroup$

I'd start with the melt() function in pandas. I wrote an article about it:



https://www.ibm.com/developerworks/community/blogs/jfp/entry/Tidy_Data_In_Python?lang=en






share|improve this answer











$endgroup$





















    2












    $begingroup$

    R's gather() essentially goes from wide to long. So,




    1. check pandas page for how to use pandas.wide_to_long(),

    2. check this blog for a discussion on getting an elegant gather-like function in Python.






    share|improve this answer









    $endgroup$





















      0












      $begingroup$

      I tried to syntactically mimic the tidyr package in python in a package called tidypython. I made it compatible with the dplython package, which includes usage of the >> operator for chaining commands.



      It hasn't been fully tested, but should work pretty well:



      https://github.com/durrantmm/tidypython



      Let me know if it works for you.






      share|improve this answer








      New contributor




      Matt Durrant is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
      Check out our Code of Conduct.






      $endgroup$













        Your Answer





        StackExchange.ifUsing("editor", function () {
        return StackExchange.using("mathjaxEditing", function () {
        StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
        StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
        });
        });
        }, "mathjax-editing");

        StackExchange.ready(function() {
        var channelOptions = {
        tags: "".split(" "),
        id: "557"
        };
        initTagRenderer("".split(" "), "".split(" "), channelOptions);

        StackExchange.using("externalEditor", function() {
        // Have to fire editor after snippets, if snippets enabled
        if (StackExchange.settings.snippets.snippetsEnabled) {
        StackExchange.using("snippets", function() {
        createEditor();
        });
        }
        else {
        createEditor();
        }
        });

        function createEditor() {
        StackExchange.prepareEditor({
        heartbeatType: 'answer',
        autoActivateHeartbeat: false,
        convertImagesToLinks: false,
        noModals: true,
        showLowRepImageUploadWarning: true,
        reputationToPostImages: null,
        bindNavPrevention: true,
        postfix: "",
        imageUploader: {
        brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
        contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
        allowUrls: true
        },
        onDemand: true,
        discardSelector: ".discard-answer"
        ,immediatelyShowMarkdownHelp:true
        });


        }
        });














        draft saved

        draft discarded


















        StackExchange.ready(
        function () {
        StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f10478%2fis-there-any-data-tidying-tool-for-python-pandas-similar-to-r-tidyr-tool%23new-answer', 'question_page');
        }
        );

        Post as a guest















        Required, but never shown

























        3 Answers
        3






        active

        oldest

        votes








        3 Answers
        3






        active

        oldest

        votes









        active

        oldest

        votes






        active

        oldest

        votes









        4












        $begingroup$

        I'd start with the melt() function in pandas. I wrote an article about it:



        https://www.ibm.com/developerworks/community/blogs/jfp/entry/Tidy_Data_In_Python?lang=en






        share|improve this answer











        $endgroup$


















          4












          $begingroup$

          I'd start with the melt() function in pandas. I wrote an article about it:



          https://www.ibm.com/developerworks/community/blogs/jfp/entry/Tidy_Data_In_Python?lang=en






          share|improve this answer











          $endgroup$
















            4












            4








            4





            $begingroup$

            I'd start with the melt() function in pandas. I wrote an article about it:



            https://www.ibm.com/developerworks/community/blogs/jfp/entry/Tidy_Data_In_Python?lang=en






            share|improve this answer











            $endgroup$



            I'd start with the melt() function in pandas. I wrote an article about it:



            https://www.ibm.com/developerworks/community/blogs/jfp/entry/Tidy_Data_In_Python?lang=en







            share|improve this answer














            share|improve this answer



            share|improve this answer








            edited Mar 26 '16 at 10:33

























            answered Mar 15 '16 at 17:54









            JFPJFP

            413




            413























                2












                $begingroup$

                R's gather() essentially goes from wide to long. So,




                1. check pandas page for how to use pandas.wide_to_long(),

                2. check this blog for a discussion on getting an elegant gather-like function in Python.






                share|improve this answer









                $endgroup$


















                  2












                  $begingroup$

                  R's gather() essentially goes from wide to long. So,




                  1. check pandas page for how to use pandas.wide_to_long(),

                  2. check this blog for a discussion on getting an elegant gather-like function in Python.






                  share|improve this answer









                  $endgroup$
















                    2












                    2








                    2





                    $begingroup$

                    R's gather() essentially goes from wide to long. So,




                    1. check pandas page for how to use pandas.wide_to_long(),

                    2. check this blog for a discussion on getting an elegant gather-like function in Python.






                    share|improve this answer









                    $endgroup$



                    R's gather() essentially goes from wide to long. So,




                    1. check pandas page for how to use pandas.wide_to_long(),

                    2. check this blog for a discussion on getting an elegant gather-like function in Python.







                    share|improve this answer












                    share|improve this answer



                    share|improve this answer










                    answered Aug 30 '16 at 15:13









                    ximikiximiki

                    4271512




                    4271512























                        0












                        $begingroup$

                        I tried to syntactically mimic the tidyr package in python in a package called tidypython. I made it compatible with the dplython package, which includes usage of the >> operator for chaining commands.



                        It hasn't been fully tested, but should work pretty well:



                        https://github.com/durrantmm/tidypython



                        Let me know if it works for you.






                        share|improve this answer








                        New contributor




                        Matt Durrant is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                        Check out our Code of Conduct.






                        $endgroup$


















                          0












                          $begingroup$

                          I tried to syntactically mimic the tidyr package in python in a package called tidypython. I made it compatible with the dplython package, which includes usage of the >> operator for chaining commands.



                          It hasn't been fully tested, but should work pretty well:



                          https://github.com/durrantmm/tidypython



                          Let me know if it works for you.






                          share|improve this answer








                          New contributor




                          Matt Durrant is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                          Check out our Code of Conduct.






                          $endgroup$
















                            0












                            0








                            0





                            $begingroup$

                            I tried to syntactically mimic the tidyr package in python in a package called tidypython. I made it compatible with the dplython package, which includes usage of the >> operator for chaining commands.



                            It hasn't been fully tested, but should work pretty well:



                            https://github.com/durrantmm/tidypython



                            Let me know if it works for you.






                            share|improve this answer








                            New contributor




                            Matt Durrant is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                            Check out our Code of Conduct.






                            $endgroup$



                            I tried to syntactically mimic the tidyr package in python in a package called tidypython. I made it compatible with the dplython package, which includes usage of the >> operator for chaining commands.



                            It hasn't been fully tested, but should work pretty well:



                            https://github.com/durrantmm/tidypython



                            Let me know if it works for you.







                            share|improve this answer








                            New contributor




                            Matt Durrant is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                            Check out our Code of Conduct.









                            share|improve this answer



                            share|improve this answer






                            New contributor




                            Matt Durrant is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                            Check out our Code of Conduct.









                            answered 21 mins ago









                            Matt DurrantMatt Durrant

                            1




                            1




                            New contributor




                            Matt Durrant is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                            Check out our Code of Conduct.





                            New contributor





                            Matt Durrant is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                            Check out our Code of Conduct.






                            Matt Durrant is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
                            Check out our Code of Conduct.






























                                draft saved

                                draft discarded




















































                                Thanks for contributing an answer to Data Science Stack Exchange!


                                • Please be sure to answer the question. Provide details and share your research!

                                But avoid



                                • Asking for help, clarification, or responding to other answers.

                                • Making statements based on opinion; back them up with references or personal experience.


                                Use MathJax to format equations. MathJax reference.


                                To learn more, see our tips on writing great answers.




                                draft saved


                                draft discarded














                                StackExchange.ready(
                                function () {
                                StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f10478%2fis-there-any-data-tidying-tool-for-python-pandas-similar-to-r-tidyr-tool%23new-answer', 'question_page');
                                }
                                );

                                Post as a guest















                                Required, but never shown





















































                                Required, but never shown














                                Required, but never shown












                                Required, but never shown







                                Required, but never shown

































                                Required, but never shown














                                Required, but never shown












                                Required, but never shown







                                Required, but never shown







                                Popular posts from this blog

                                Ponta tanko

                                Tantalo (mitologio)

                                Erzsébet Schaár