You can correct it with following cygwin/unix commands.
- Code: Select all
grep -n "0,tcp,private,S0,0,0,0,0,0,0,0,0,0,0,00,tcp,http,SF" KDDCup99_full.arff
which gives following output.
4817251:0,tcp,private,S0,0,0,0,0,0,0,0,0,0,0,00,tcp,http,SF,334,1684,0,0,0,0,0,1
,0,0,0,0,0,0,0,0,0,0,1,9,0.00,0.00,0.00,0.00,1.00,0.00,0.33,0,0,0.00,0.00,0.00,0
.00,0.00,0.00,0.00,0.00,normal
- Code: Select all
sed -n 1,4817250p KDDCup99_full.arff > full1.arff
sed -n 4817252,4898582p KDDCup99_full.arff > full2.arff
cat full1.arff full2.arff > KDDCup99_fullCorrected.arff