how to remove last two "-delimited strings from each line in a large file

I have numerous 2GB space-delimited files from a source system. Each row in each file contains the same number of strings surrounded by " as text qualifiers.

I need to eliminate the last two strings and their text qualifiers from every row in each file. (like removing the last two columns from a columnar report). With smaller files, I can import into Excel, delimit, delete the columns, save as tab delimited (much more useful than spaces).

Anycase, these files are too large and have too many rows for excel. So sed??

"text1" "text2" "text3" "text4" "text5" "text6"

Every row has the same number of strings. How do I drop "text5" "text6" from every row?

edited May 18 '17 at 1:30

Stephen Rauch

3,344101428

asked May 18 '17 at 1:19

user231894

awk '{$5=$6=""}1' file...

– jasonwryan
May 18 '17 at 1:41

@jasonwryan: Or just awk 'NF=4'

– Thor
May 18 '17 at 5:04

@Thor better...

– jasonwryan
May 18 '17 at 5:07

add a comment |

I have numerous 2GB space-delimited files from a source system. Each row in each file contains the same number of strings surrounded by " as text qualifiers.

Anycase, these files are too large and have too many rows for excel. So sed??

"text1" "text2" "text3" "text4" "text5" "text6"

Every row has the same number of strings. How do I drop "text5" "text6" from every row?

edited May 18 '17 at 1:30

Stephen Rauch

3,344101428

asked May 18 '17 at 1:19

user231894

awk '{$5=$6=""}1' file...

– jasonwryan
May 18 '17 at 1:41

@jasonwryan: Or just awk 'NF=4'

– Thor
May 18 '17 at 5:04

@Thor better...

– jasonwryan
May 18 '17 at 5:07

add a comment |

I have numerous 2GB space-delimited files from a source system. Each row in each file contains the same number of strings surrounded by " as text qualifiers.

Anycase, these files are too large and have too many rows for excel. So sed??

"text1" "text2" "text3" "text4" "text5" "text6"

Every row has the same number of strings. How do I drop "text5" "text6" from every row?

edited May 18 '17 at 1:30

Stephen Rauch

3,344101428

asked May 18 '17 at 1:19

user231894

I have numerous 2GB space-delimited files from a source system. Each row in each file contains the same number of strings surrounded by " as text qualifiers.

Anycase, these files are too large and have too many rows for excel. So sed??

"text1" "text2" "text3" "text4" "text5" "text6"

Every row has the same number of strings. How do I drop "text5" "text6" from every row?

text-processing sed text delete

edited May 18 '17 at 1:30

Stephen Rauch

3,344101428

asked May 18 '17 at 1:19

user231894

edited May 18 '17 at 1:30

Stephen Rauch

3,344101428

asked May 18 '17 at 1:19

user231894

edited May 18 '17 at 1:30

Stephen Rauch

3,344101428

edited May 18 '17 at 1:30

Stephen Rauch

3,344101428

edited May 18 '17 at 1:30

Stephen Rauch

3,344101428

asked May 18 '17 at 1:19

user231894

asked May 18 '17 at 1:19

user231894

asked May 18 '17 at 1:19

user231894

awk '{$5=$6=""}1' file...

– jasonwryan
May 18 '17 at 1:41

@jasonwryan: Or just awk 'NF=4'

– Thor
May 18 '17 at 5:04

@Thor better...

– jasonwryan
May 18 '17 at 5:07

add a comment |

awk '{$5=$6=""}1' file...

– jasonwryan
May 18 '17 at 1:41

@jasonwryan: Or just awk 'NF=4'

– Thor
May 18 '17 at 5:04

@Thor better...

– jasonwryan
May 18 '17 at 5:07

awk '{$5=$6=""}1' file...

– jasonwryan
May 18 '17 at 1:41

@jasonwryan: Or just awk 'NF=4'

– Thor
May 18 '17 at 5:04

@Thor better...

– jasonwryan
May 18 '17 at 5:07

add a comment |

4 Answers
4

active

oldest

votes

This sed command will remove the last two space separated, quoted strings from the end of each line from file infile and send the results to outfile:

sed 's/ *"[^"]*" *"[^"]*" *$//' < infile > outfile

answered May 18 '17 at 1:38

Stephen Rauch

3,344101428

add a comment |

If you know that you always want to delete the last two columns, this idiom can be used:

awk 'NF-=2' file

I noticed that this does not work with nawk, not sure why. The portable way is to force the field splitting with `$1=$1:

awk '{NF-=2} $1=$1' file

Output:

"text1" "text2" "text3" "text4"

edited May 18 '17 at 5:18

answered May 18 '17 at 5:08

Thor

11.9k13459

add a comment |

awk '{$(NF-1)=$NF=""}1'  inp



perl -pale '$_ = "@F[0..@F-3]"' inp



sed -ne '

   s/" "/"

"/g

   :a

   s/n/ /

   /n.*n.*n/ba

   P

' inp

Explanation:

awk code nulls out the last and second-last fields and prints.

In perl fields are stored in @F array and the slice from 0th to third-last are selected and stored in the current line $_. The double quotes are there to effect the array->string xformation and joined together by the $" superglobal whose default value is a space. -p Perl option then carries the $_ value to the stdout.

In sed we first turn all the patterns " " ---> "n" then we enter a loop where we take back the n till there are two left. At which point of time we use the P uppercase p, command to print the first portion of the pattern space.

answered May 18 '17 at 3:53

user218374

add a comment |

Printing every field till last-2..AWK provided us number of fields in a row using variable NF

echo "text1" "text2" "text3" "text4" "text5" "text6" | awk  -v ORS=""  '{

for(i=1;i<=NF-2;i++)print $i, " " ; print "n"}'

edited 24 mins ago

answered 37 mins ago

Deepika Reddy Billuri

New contributor

add a comment |

Your Answer

StackExchange.ready(function() {
var channelOptions = {
tags: "".split(" "),
id: "106"
};
initTagRenderer("".split(" "), "".split(" "), channelOptions);

StackExchange.using("externalEditor", function() {
// Have to fire editor after snippets, if snippets enabled
if (StackExchange.settings.snippets.snippetsEnabled) {
StackExchange.using("snippets", function() {
createEditor();
});
}
else {
createEditor();
}
});

function createEditor() {
StackExchange.prepareEditor({
heartbeatType: 'answer',
autoActivateHeartbeat: false,
convertImagesToLinks: false,
noModals: true,
showLowRepImageUploadWarning: true,
reputationToPostImages: null,
bindNavPrevention: true,
postfix: "",
imageUploader: {
brandingHtml: "Powered by u003ca class="icon-imgur-white" href="https://imgur.com/"u003eu003c/au003e",
contentPolicyHtml: "User contributions licensed under u003ca href="https://creativecommons.org/licenses/by-sa/3.0/"u003ecc by-sa 3.0 with attribution requiredu003c/au003e u003ca href="https://stackoverflow.com/legal/content-policy"u003e(content policy)u003c/au003e",
allowUrls: true
},
onDemand: true,
discardSelector: ".discard-answer"
,immediatelyShowMarkdownHelp:true
});

}
});

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

StackExchange.ready(
function () {
StackExchange.openid.initPostLogin('.new-post-login', 'https%3a%2f%2funix.stackexchange.com%2fquestions%2f365749%2fhow-to-remove-last-two-delimited-strings-from-each-line-in-a-large-file%23new-answer', 'question_page');
}
);

Post as a guest

Name

Required, but never shown

4 Answers
4

active

oldest

votes

4 Answers
4

active

oldest

votes

This sed command will remove the last two space separated, quoted strings from the end of each line from file infile and send the results to outfile:

sed 's/ *"[^"]*" *"[^"]*" *$//' < infile > outfile

answered May 18 '17 at 1:38

Stephen Rauch

3,344101428

add a comment |

This sed command will remove the last two space separated, quoted strings from the end of each line from file infile and send the results to outfile:

sed 's/ *"[^"]*" *"[^"]*" *$//' < infile > outfile

answered May 18 '17 at 1:38

Stephen Rauch

3,344101428

add a comment |

This sed command will remove the last two space separated, quoted strings from the end of each line from file infile and send the results to outfile:

sed 's/ *"[^"]*" *"[^"]*" *$//' < infile > outfile

answered May 18 '17 at 1:38

Stephen Rauch

3,344101428

This sed command will remove the last two space separated, quoted strings from the end of each line from file infile and send the results to outfile:

sed 's/ *"[^"]*" *"[^"]*" *$//' < infile > outfile

answered May 18 '17 at 1:38

Stephen Rauch

3,344101428

answered May 18 '17 at 1:38

Stephen Rauch

3,344101428

answered May 18 '17 at 1:38

Stephen Rauch

3,344101428

answered May 18 '17 at 1:38

Stephen Rauch

3,344101428

add a comment |

If you know that you always want to delete the last two columns, this idiom can be used:

awk 'NF-=2' file

I noticed that this does not work with nawk, not sure why. The portable way is to force the field splitting with `$1=$1:

awk '{NF-=2} $1=$1' file

Output:

"text1" "text2" "text3" "text4"

edited May 18 '17 at 5:18

answered May 18 '17 at 5:08

Thor

11.9k13459

add a comment |

If you know that you always want to delete the last two columns, this idiom can be used:

awk 'NF-=2' file

I noticed that this does not work with nawk, not sure why. The portable way is to force the field splitting with `$1=$1:

awk '{NF-=2} $1=$1' file

Output:

"text1" "text2" "text3" "text4"

edited May 18 '17 at 5:18

answered May 18 '17 at 5:08

Thor

11.9k13459

add a comment |

If you know that you always want to delete the last two columns, this idiom can be used:

awk 'NF-=2' file

I noticed that this does not work with nawk, not sure why. The portable way is to force the field splitting with `$1=$1:

awk '{NF-=2} $1=$1' file

Output:

"text1" "text2" "text3" "text4"

edited May 18 '17 at 5:18

answered May 18 '17 at 5:08

Thor

11.9k13459

If you know that you always want to delete the last two columns, this idiom can be used:

awk 'NF-=2' file

I noticed that this does not work with nawk, not sure why. The portable way is to force the field splitting with `$1=$1:

awk '{NF-=2} $1=$1' file

Output:

"text1" "text2" "text3" "text4"

edited May 18 '17 at 5:18

answered May 18 '17 at 5:08

Thor

11.9k13459

edited May 18 '17 at 5:18

answered May 18 '17 at 5:08

Thor

11.9k13459

answered May 18 '17 at 5:08

Thor

11.9k13459

answered May 18 '17 at 5:08

Thor

11.9k13459

add a comment |

awk '{$(NF-1)=$NF=""}1'  inp



perl -pale '$_ = "@F[0..@F-3]"' inp



sed -ne '

   s/" "/"

"/g

   :a

   s/n/ /

   /n.*n.*n/ba

   P

' inp

Explanation:

awk code nulls out the last and second-last fields and prints.

In perl fields are stored in @F array and the slice from 0th to third-last are selected and stored in the current line $_. The double quotes are there to effect the array->string xformation and joined together by the $" superglobal whose default value is a space. -p Perl option then carries the $_ value to the stdout.

In sed we first turn all the patterns " " ---> "n" then we enter a loop where we take back the n till there are two left. At which point of time we use the P uppercase p, command to print the first portion of the pattern space.

answered May 18 '17 at 3:53

user218374

add a comment |

awk '{$(NF-1)=$NF=""}1'  inp



perl -pale '$_ = "@F[0..@F-3]"' inp



sed -ne '

   s/" "/"

"/g

   :a

   s/n/ /

   /n.*n.*n/ba

   P

' inp

Explanation:

awk code nulls out the last and second-last fields and prints.

In perl fields are stored in @F array and the slice from 0th to third-last are selected and stored in the current line $_. The double quotes are there to effect the array->string xformation and joined together by the $" superglobal whose default value is a space. -p Perl option then carries the $_ value to the stdout.

In sed we first turn all the patterns " " ---> "n" then we enter a loop where we take back the n till there are two left. At which point of time we use the P uppercase p, command to print the first portion of the pattern space.

answered May 18 '17 at 3:53

user218374

add a comment |

awk '{$(NF-1)=$NF=""}1'  inp



perl -pale '$_ = "@F[0..@F-3]"' inp



sed -ne '

   s/" "/"

"/g

   :a

   s/n/ /

   /n.*n.*n/ba

   P

' inp

Explanation:

awk code nulls out the last and second-last fields and prints.

In perl fields are stored in @F array and the slice from 0th to third-last are selected and stored in the current line $_. The double quotes are there to effect the array->string xformation and joined together by the $" superglobal whose default value is a space. -p Perl option then carries the $_ value to the stdout.

In sed we first turn all the patterns " " ---> "n" then we enter a loop where we take back the n till there are two left. At which point of time we use the P uppercase p, command to print the first portion of the pattern space.

answered May 18 '17 at 3:53

user218374

awk '{$(NF-1)=$NF=""}1'  inp



perl -pale '$_ = "@F[0..@F-3]"' inp



sed -ne '

   s/" "/"

"/g

   :a

   s/n/ /

   /n.*n.*n/ba

   P

' inp

Explanation:

awk code nulls out the last and second-last fields and prints.

In perl fields are stored in @F array and the slice from 0th to third-last are selected and stored in the current line $_. The double quotes are there to effect the array->string xformation and joined together by the $" superglobal whose default value is a space. -p Perl option then carries the $_ value to the stdout.

In sed we first turn all the patterns " " ---> "n" then we enter a loop where we take back the n till there are two left. At which point of time we use the P uppercase p, command to print the first portion of the pattern space.

answered May 18 '17 at 3:53

user218374

answered May 18 '17 at 3:53

user218374

answered May 18 '17 at 3:53

user218374

answered May 18 '17 at 3:53

user218374

add a comment |

Printing every field till last-2..AWK provided us number of fields in a row using variable NF

echo "text1" "text2" "text3" "text4" "text5" "text6" | awk  -v ORS=""  '{

for(i=1;i<=NF-2;i++)print $i, " " ; print "n"}'

edited 24 mins ago

answered 37 mins ago

Deepika Reddy Billuri

New contributor

add a comment |

Printing every field till last-2..AWK provided us number of fields in a row using variable NF

echo "text1" "text2" "text3" "text4" "text5" "text6" | awk  -v ORS=""  '{

for(i=1;i<=NF-2;i++)print $i, " " ; print "n"}'

edited 24 mins ago

answered 37 mins ago

Deepika Reddy Billuri

New contributor

add a comment |

Printing every field till last-2..AWK provided us number of fields in a row using variable NF

echo "text1" "text2" "text3" "text4" "text5" "text6" | awk  -v ORS=""  '{

for(i=1;i<=NF-2;i++)print $i, " " ; print "n"}'

edited 24 mins ago

answered 37 mins ago

Deepika Reddy Billuri

New contributor

Printing every field till last-2..AWK provided us number of fields in a row using variable NF

echo "text1" "text2" "text3" "text4" "text5" "text6" | awk  -v ORS=""  '{

for(i=1;i<=NF-2;i++)print $i, " " ; print "n"}'

edited 24 mins ago

answered 37 mins ago

Deepika Reddy Billuri

New contributor

edited 24 mins ago

answered 37 mins ago

Deepika Reddy Billuri

New contributor

answered 37 mins ago

Deepika Reddy Billuri

answered 37 mins ago

Deepika Reddy Billuri

New contributor

Deepika Reddy Billuri is a new contributor to this site. Take care in asking for clarification, commenting, and answering.
Check out our Code of Conduct.

add a comment |

draft saved

draft discarded

Thanks for contributing an answer to Unix & Linux Stack Exchange!

Please be sure to answer the question. Provide details and share your research!

But avoid …

Asking for help, clarification, or responding to other answers.

Making statements based on opinion; back them up with references or personal experience.

To learn more, see our tips on writing great answers.

draft saved

draft discarded

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Sign up or log in

StackExchange.ready(function () {
StackExchange.helpers.onClickDraftSave('#login-link');
});

Post as a guest

Name

Required, but never shown

Name

Required, but never shown

Name

Required, but never shown

This page is only for reference, If you need detailed information, please check here

搜尋此網誌

Yrurtj