Is There any RNN method used for Object detection
$begingroup$
after reading the state of the art about object detection using CNN (R-CNN Faster R-CNN ,YOLO, SSD...) I was wondering if there is a method that use RNN's or that combine the use of CNN's and RNN's for object detection ??
Thank you
neural-network deep-learning convnet computer-vision recurrent-neural-net
$endgroup$
add a comment |
$begingroup$
after reading the state of the art about object detection using CNN (R-CNN Faster R-CNN ,YOLO, SSD...) I was wondering if there is a method that use RNN's or that combine the use of CNN's and RNN's for object detection ??
Thank you
neural-network deep-learning convnet computer-vision recurrent-neural-net
$endgroup$
$begingroup$
Recurrent YOLO for object tracking - Project page - paper: enter link description here - Tensorflow implementation
$endgroup$
– Alireza Akhavan
Oct 10 '18 at 5:37
add a comment |
$begingroup$
after reading the state of the art about object detection using CNN (R-CNN Faster R-CNN ,YOLO, SSD...) I was wondering if there is a method that use RNN's or that combine the use of CNN's and RNN's for object detection ??
Thank you
neural-network deep-learning convnet computer-vision recurrent-neural-net
$endgroup$
after reading the state of the art about object detection using CNN (R-CNN Faster R-CNN ,YOLO, SSD...) I was wondering if there is a method that use RNN's or that combine the use of CNN's and RNN's for object detection ??
Thank you
neural-network deep-learning convnet computer-vision recurrent-neural-net
neural-network deep-learning convnet computer-vision recurrent-neural-net
asked Feb 21 '18 at 11:00
hbdzhbdz
769
769
$begingroup$
Recurrent YOLO for object tracking - Project page - paper: enter link description here - Tensorflow implementation
$endgroup$
– Alireza Akhavan
Oct 10 '18 at 5:37
add a comment |
$begingroup$
Recurrent YOLO for object tracking - Project page - paper: enter link description here - Tensorflow implementation
$endgroup$
– Alireza Akhavan
Oct 10 '18 at 5:37
$begingroup$
Recurrent YOLO for object tracking - Project page - paper: enter link description here - Tensorflow implementation
$endgroup$
– Alireza Akhavan
Oct 10 '18 at 5:37
$begingroup$
Recurrent YOLO for object tracking - Project page - paper: enter link description here - Tensorflow implementation
$endgroup$
– Alireza Akhavan
Oct 10 '18 at 5:37
add a comment |
3 Answers
3
active
oldest
votes
$begingroup$
Yes, there have been many attempts, but perhaps the most noteable one is the approach described in the paper of Andrej Karpathy and Li Fei-Fei where they connect a CNN and RNN in series (CNN over image region + bidirectional RNN + Multimodal RNN) and use this for labeling a scene with a whole sentence. Though, this one is more than just object detection as it leverages a data set of scenes and their descriptions to generate natural language descriptions of new unseen images.
Another example is Ming Liang and Xiaolin Hu's approche where they mix a CNN with an RNN and use this architecture for better object detection. As Ming and Xiaolin explained in their paper (linked above), the RNN is used to improve the CNN:
A prominent difference is that CNN is typically a feed-forward architecture while in the visual system recurrent connections are abundant. Inspired by this fact, we propose a recurrent CNN (RCNN) for object recognition by incorporating recurrent connections into each convolutional layer.
$endgroup$
add a comment |
$begingroup$
Actually, I do not think it should be a good way of using RNN only to do object detection work, because there is no "Receptive Field" conception in RNN compared with CNN, which I think should be a key point in doing vision related task.
$endgroup$
add a comment |
$begingroup$
Recurrent Neural Networks (RNN) are the state of the art algorithm for sequential data and Long Short-Term Memory (LSTM) networks are an extension for RNN. This method can be used on object detection in case detect object in video or moving images, etc. You can try this https://github.com/tensorflow/models/tree/master/research/lstm_object_detection. It implementation from Tensorflow mobile video object detection implementation proposed in the following paper: Mobile Video Object Detection with Temporally-Aware Feature Maps (CVPR 2018). The link of paper: http://openaccess.thecvf.com/content_cvpr_2018/papers/Liu_Mobile_Video_Object_CVPR_2018_paper.pdf
New contributor
$endgroup$
add a comment |
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "557"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f28095%2fis-there-any-rnn-method-used-for-object-detection%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
3 Answers
3
active
oldest
votes
3 Answers
3
active
oldest
votes
active
oldest
votes
active
oldest
votes
$begingroup$
Yes, there have been many attempts, but perhaps the most noteable one is the approach described in the paper of Andrej Karpathy and Li Fei-Fei where they connect a CNN and RNN in series (CNN over image region + bidirectional RNN + Multimodal RNN) and use this for labeling a scene with a whole sentence. Though, this one is more than just object detection as it leverages a data set of scenes and their descriptions to generate natural language descriptions of new unseen images.
Another example is Ming Liang and Xiaolin Hu's approche where they mix a CNN with an RNN and use this architecture for better object detection. As Ming and Xiaolin explained in their paper (linked above), the RNN is used to improve the CNN:
A prominent difference is that CNN is typically a feed-forward architecture while in the visual system recurrent connections are abundant. Inspired by this fact, we propose a recurrent CNN (RCNN) for object recognition by incorporating recurrent connections into each convolutional layer.
$endgroup$
add a comment |
$begingroup$
Yes, there have been many attempts, but perhaps the most noteable one is the approach described in the paper of Andrej Karpathy and Li Fei-Fei where they connect a CNN and RNN in series (CNN over image region + bidirectional RNN + Multimodal RNN) and use this for labeling a scene with a whole sentence. Though, this one is more than just object detection as it leverages a data set of scenes and their descriptions to generate natural language descriptions of new unseen images.
Another example is Ming Liang and Xiaolin Hu's approche where they mix a CNN with an RNN and use this architecture for better object detection. As Ming and Xiaolin explained in their paper (linked above), the RNN is used to improve the CNN:
A prominent difference is that CNN is typically a feed-forward architecture while in the visual system recurrent connections are abundant. Inspired by this fact, we propose a recurrent CNN (RCNN) for object recognition by incorporating recurrent connections into each convolutional layer.
$endgroup$
add a comment |
$begingroup$
Yes, there have been many attempts, but perhaps the most noteable one is the approach described in the paper of Andrej Karpathy and Li Fei-Fei where they connect a CNN and RNN in series (CNN over image region + bidirectional RNN + Multimodal RNN) and use this for labeling a scene with a whole sentence. Though, this one is more than just object detection as it leverages a data set of scenes and their descriptions to generate natural language descriptions of new unseen images.
Another example is Ming Liang and Xiaolin Hu's approche where they mix a CNN with an RNN and use this architecture for better object detection. As Ming and Xiaolin explained in their paper (linked above), the RNN is used to improve the CNN:
A prominent difference is that CNN is typically a feed-forward architecture while in the visual system recurrent connections are abundant. Inspired by this fact, we propose a recurrent CNN (RCNN) for object recognition by incorporating recurrent connections into each convolutional layer.
$endgroup$
Yes, there have been many attempts, but perhaps the most noteable one is the approach described in the paper of Andrej Karpathy and Li Fei-Fei where they connect a CNN and RNN in series (CNN over image region + bidirectional RNN + Multimodal RNN) and use this for labeling a scene with a whole sentence. Though, this one is more than just object detection as it leverages a data set of scenes and their descriptions to generate natural language descriptions of new unseen images.
Another example is Ming Liang and Xiaolin Hu's approche where they mix a CNN with an RNN and use this architecture for better object detection. As Ming and Xiaolin explained in their paper (linked above), the RNN is used to improve the CNN:
A prominent difference is that CNN is typically a feed-forward architecture while in the visual system recurrent connections are abundant. Inspired by this fact, we propose a recurrent CNN (RCNN) for object recognition by incorporating recurrent connections into each convolutional layer.
edited Jun 15 '18 at 0:30
Kyle.
1,1901829
1,1901829
answered Feb 21 '18 at 15:31
Achraf OussidiAchraf Oussidi
894
894
add a comment |
add a comment |
$begingroup$
Actually, I do not think it should be a good way of using RNN only to do object detection work, because there is no "Receptive Field" conception in RNN compared with CNN, which I think should be a key point in doing vision related task.
$endgroup$
add a comment |
$begingroup$
Actually, I do not think it should be a good way of using RNN only to do object detection work, because there is no "Receptive Field" conception in RNN compared with CNN, which I think should be a key point in doing vision related task.
$endgroup$
add a comment |
$begingroup$
Actually, I do not think it should be a good way of using RNN only to do object detection work, because there is no "Receptive Field" conception in RNN compared with CNN, which I think should be a key point in doing vision related task.
$endgroup$
Actually, I do not think it should be a good way of using RNN only to do object detection work, because there is no "Receptive Field" conception in RNN compared with CNN, which I think should be a key point in doing vision related task.
answered Jun 14 '18 at 21:54
ZQ HuZQ Hu
111
111
add a comment |
add a comment |
$begingroup$
Recurrent Neural Networks (RNN) are the state of the art algorithm for sequential data and Long Short-Term Memory (LSTM) networks are an extension for RNN. This method can be used on object detection in case detect object in video or moving images, etc. You can try this https://github.com/tensorflow/models/tree/master/research/lstm_object_detection. It implementation from Tensorflow mobile video object detection implementation proposed in the following paper: Mobile Video Object Detection with Temporally-Aware Feature Maps (CVPR 2018). The link of paper: http://openaccess.thecvf.com/content_cvpr_2018/papers/Liu_Mobile_Video_Object_CVPR_2018_paper.pdf
New contributor
$endgroup$
add a comment |
$begingroup$
Recurrent Neural Networks (RNN) are the state of the art algorithm for sequential data and Long Short-Term Memory (LSTM) networks are an extension for RNN. This method can be used on object detection in case detect object in video or moving images, etc. You can try this https://github.com/tensorflow/models/tree/master/research/lstm_object_detection. It implementation from Tensorflow mobile video object detection implementation proposed in the following paper: Mobile Video Object Detection with Temporally-Aware Feature Maps (CVPR 2018). The link of paper: http://openaccess.thecvf.com/content_cvpr_2018/papers/Liu_Mobile_Video_Object_CVPR_2018_paper.pdf
New contributor
$endgroup$
add a comment |
$begingroup$
Recurrent Neural Networks (RNN) are the state of the art algorithm for sequential data and Long Short-Term Memory (LSTM) networks are an extension for RNN. This method can be used on object detection in case detect object in video or moving images, etc. You can try this https://github.com/tensorflow/models/tree/master/research/lstm_object_detection. It implementation from Tensorflow mobile video object detection implementation proposed in the following paper: Mobile Video Object Detection with Temporally-Aware Feature Maps (CVPR 2018). The link of paper: http://openaccess.thecvf.com/content_cvpr_2018/papers/Liu_Mobile_Video_Object_CVPR_2018_paper.pdf
New contributor
$endgroup$
Recurrent Neural Networks (RNN) are the state of the art algorithm for sequential data and Long Short-Term Memory (LSTM) networks are an extension for RNN. This method can be used on object detection in case detect object in video or moving images, etc. You can try this https://github.com/tensorflow/models/tree/master/research/lstm_object_detection. It implementation from Tensorflow mobile video object detection implementation proposed in the following paper: Mobile Video Object Detection with Temporally-Aware Feature Maps (CVPR 2018). The link of paper: http://openaccess.thecvf.com/content_cvpr_2018/papers/Liu_Mobile_Video_Object_CVPR_2018_paper.pdf
New contributor
New contributor
answered 16 mins ago
user8504513user8504513
1
1
New contributor
New contributor
add a comment |
add a comment |
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f28095%2fis-there-any-rnn-method-used-for-object-detection%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
$begingroup$
Recurrent YOLO for object tracking - Project page - paper: enter link description here - Tensorflow implementation
$endgroup$
– Alireza Akhavan
Oct 10 '18 at 5:37