How do I calculate the weight values for a piece-wise linear ReLU network approximating function?
$begingroup$
I'm currently studying ReLU neural networks and their ability to approximate functions. Specifically, I'm looking at using ReLU networks to approximate a function using piece-wise linear components. I've been referencing this paper and specifically the excerpt in the snippet below.
Let's say I have a known function, how would I go about actually generating a ReLU network that approximates it? That is, I want a ReLU neural network that outputs a linear piece-wise function. For example, looking at the right plot on the image below:
My issue here is that even when I know the function, it's very difficult to to determine the weights to turn on the ReLU with the correct slopes at the correct time.
I can cause delays using the ReLUs by modifying the b parameter (per the equation), however, it seems like the current slope must be calculated such that all the previous ReLU slopes are negated.. this is due to the fact that once a ReLU turns on for some X input, it doesn't turn off as X increases.
So my question: Assuming I know a function perfectly (can calculate slopes, etc.) and I want to approximate it piece-wise linear using a ReLU network, how do I actually determine the weight values (i.e. slopes of each ReLU)? This appears to be a non-trivial task.
neural-network deep-learning
New contributor
$endgroup$
add a comment |
$begingroup$
I'm currently studying ReLU neural networks and their ability to approximate functions. Specifically, I'm looking at using ReLU networks to approximate a function using piece-wise linear components. I've been referencing this paper and specifically the excerpt in the snippet below.
Let's say I have a known function, how would I go about actually generating a ReLU network that approximates it? That is, I want a ReLU neural network that outputs a linear piece-wise function. For example, looking at the right plot on the image below:
My issue here is that even when I know the function, it's very difficult to to determine the weights to turn on the ReLU with the correct slopes at the correct time.
I can cause delays using the ReLUs by modifying the b parameter (per the equation), however, it seems like the current slope must be calculated such that all the previous ReLU slopes are negated.. this is due to the fact that once a ReLU turns on for some X input, it doesn't turn off as X increases.
So my question: Assuming I know a function perfectly (can calculate slopes, etc.) and I want to approximate it piece-wise linear using a ReLU network, how do I actually determine the weight values (i.e. slopes of each ReLU)? This appears to be a non-trivial task.
neural-network deep-learning
New contributor
$endgroup$
add a comment |
$begingroup$
I'm currently studying ReLU neural networks and their ability to approximate functions. Specifically, I'm looking at using ReLU networks to approximate a function using piece-wise linear components. I've been referencing this paper and specifically the excerpt in the snippet below.
Let's say I have a known function, how would I go about actually generating a ReLU network that approximates it? That is, I want a ReLU neural network that outputs a linear piece-wise function. For example, looking at the right plot on the image below:
My issue here is that even when I know the function, it's very difficult to to determine the weights to turn on the ReLU with the correct slopes at the correct time.
I can cause delays using the ReLUs by modifying the b parameter (per the equation), however, it seems like the current slope must be calculated such that all the previous ReLU slopes are negated.. this is due to the fact that once a ReLU turns on for some X input, it doesn't turn off as X increases.
So my question: Assuming I know a function perfectly (can calculate slopes, etc.) and I want to approximate it piece-wise linear using a ReLU network, how do I actually determine the weight values (i.e. slopes of each ReLU)? This appears to be a non-trivial task.
neural-network deep-learning
New contributor
$endgroup$
I'm currently studying ReLU neural networks and their ability to approximate functions. Specifically, I'm looking at using ReLU networks to approximate a function using piece-wise linear components. I've been referencing this paper and specifically the excerpt in the snippet below.
Let's say I have a known function, how would I go about actually generating a ReLU network that approximates it? That is, I want a ReLU neural network that outputs a linear piece-wise function. For example, looking at the right plot on the image below:
My issue here is that even when I know the function, it's very difficult to to determine the weights to turn on the ReLU with the correct slopes at the correct time.
I can cause delays using the ReLUs by modifying the b parameter (per the equation), however, it seems like the current slope must be calculated such that all the previous ReLU slopes are negated.. this is due to the fact that once a ReLU turns on for some X input, it doesn't turn off as X increases.
So my question: Assuming I know a function perfectly (can calculate slopes, etc.) and I want to approximate it piece-wise linear using a ReLU network, how do I actually determine the weight values (i.e. slopes of each ReLU)? This appears to be a non-trivial task.
neural-network deep-learning
neural-network deep-learning
New contributor
New contributor
New contributor
asked 6 mins ago
IzzoIzzo
1011
1011
New contributor
New contributor
add a comment |
add a comment |
0
active
oldest
votes
Your Answer
StackExchange.ifUsing("editor", function () {
return StackExchange.using("mathjaxEditing", function () {
StackExchange.MarkdownEditor.creationCallbacks.add(function (editor, postfix) {
StackExchange.mathjaxEditing.prepareWmdForMathJax(editor, postfix, [["$", "$"], ["\\(","\\)"]]);
});
});
}, "mathjax-editing");
StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "557"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);
StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});
function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});
}
});
Izzo is a new contributor. Be nice, and check out our Code of Conduct.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47204%2fhow-do-i-calculate-the-weight-values-for-a-piece-wise-linear-relu-network-approx%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
0
active
oldest
votes
0
active
oldest
votes
active
oldest
votes
active
oldest
votes
Izzo is a new contributor. Be nice, and check out our Code of Conduct.
Izzo is a new contributor. Be nice, and check out our Code of Conduct.
Izzo is a new contributor. Be nice, and check out our Code of Conduct.
Izzo is a new contributor. Be nice, and check out our Code of Conduct.
Thanks for contributing an answer to Data Science Stack Exchange!
- Please be sure to answer the question. Provide details and share your research!
But avoid …
- Asking for help, clarification, or responding to other answers.
- Making statements based on opinion; back them up with references or personal experience.
Use MathJax to format equations. MathJax reference.
To learn more, see our tips on writing great answers.
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2fdatascience.stackexchange.com%2fquestions%2f47204%2fhow-do-i-calculate-the-weight-values-for-a-piece-wise-linear-relu-network-approx%23new-answer', 'question_page');
}
);
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Sign up or log in
StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Sign up using Google
Sign up using Facebook
Sign up using Email and Password
Post as a guest
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown
Required, but never shown