[awk] split a file into multiple files Solved

Question

Hello everyone,

I'm going around in circles on a script that must be 100% in awk, but I'm wondering if it's actually possible. I tried it with chatGPT but it gave me nonsense :-(

Here’s my problem:

In the following file, I want to create a new file each time we encounter the word PAGE in the input file. The line containing the word PAGE is included at the beginning of the newly created file.
If somewhere in the file, after the word PAGE, we find the word NIR, then the NIR variable is updated accordingly and used to name this file. The file will be named after the value of the NIR variable, followed by the extension ".txt".

Example:

Input file:

  totot  titi PAGE  tata  tata dfdf fdf NIR un deux troix quatre tata dfdfd dfdfdf dfdf PAGE dfd fdfdfd dfdf dfddfdfdf NIR one two three four five dfdf df PAGE dfdf NIR dfdf dfdfd

Expected result:

The first file created is named un.txt and contains:

  titi PAGE  tata  tata dfdf fdf NIR un deux troix quatre tata dfdfd

The second file created is named one.txt and contains:

 dfdfdf dfdf PAGE dfd fdfdfd dfdf dfddfdfdf NIR one two three four five dfdf

The third file created is named dfdf.txt and contains:

 df PAGE dfdf NIR dfdf dfdfd

Thank you in advance

dubcek · Answer

hello

try

$ awk 'BEGIN {while(getline < "file")if($0 ~ /NIR/)for(n=1; n<=NF; n++)if($n ~ /NIR/)t[++f]=$(n+1)} /PAGE/ {a=t[++x] ".txt"} a {print $0 > a }' file $ $ more *txt :::::::::::::: dfdf.txt :::::::::::::: df PAGE dfdf NIR dfdf dfdfd :::::::::::::: one.txt :::::::::::::: dfdfdf dfdf PAGE dfd fdfdfd dfdf dfddfdfdf NIR one two three four five dfdf :::::::::::::: un.txt :::::::::::::: titi PAGE tata tata dfdf fdf NIR un deux trois quatre tata dfdfd

mamiemando · Answer

Hello,

I rewrote the initial message because some phrasing was ambiguous, and I’m not surprised that ChatGPT stumbled (I don’t actually believe ChatGPT can properly solve a non-trivial programming exercise, but anyway).

That said, the problem definition is incomplete:

What happens (and should happen) if the keyword NIR does not appear after the word PAGE?
- Do you still create a new file?
- If so, what do you name it?

Good luck

DIE · Answer

it works SENSATIONALLY WELL :) thank you so much I don't understand how t[++f] works compared to t[++x] if you could enlighten me

DIE · Answer

thank you both, it's very clear when reading you but very confusing to come up with it all by myself ;)  Definitely a lack of logic

[awk] split a file into multiple files

4 answers

Read a facebook video saved as a folder

Startup issue

No sound on my pc

System restore from bootable windows 11 usb drive

My printer won't connect to my pc anymore

Windows 11 microphone issue

File with some lines merged and others not

Notice use techfive dxf00800

Asus vivobook pc won’t boot anymore, need help

Troubleshooting windows 10 pro installation on lenovo