Comparison of Sampling Strategies for Flow Length Estimation

Demir M. O., Kurt B., Buyukcorak S., Karabulut Kurt G. Z., CEMGİL A. T., Zeydan E.

22nd IEEE Signal Processing and Communications Applications Conference (SIU), Trabzon, Turkey, 23 - 25 April 2014, pp.2182-2185 identifier

  • Publication Type: Conference Paper / Full Text
  • City: Trabzon
  • Country: Turkey
  • Page Numbers: pp.2182-2185
  • Istanbul Technical University Affiliated: Yes


The importance of the analysis and understanding of the network traffic has constantly been increasing due to insights that this provides towards determination of user behaviour and resource usage. The data analyses in order to determine the related parameters are performed by selection of a small subset of the complete flow data due to data privacy and heavy computational/memory load issues. That is, sampling is required in order to detect the properties of the complete data set. In this work, four distinct sampling schemes, namely the packet based uniform sampling, time-slot based uniform sampling, packet based random sampling and time-slot based random sampling are investigated from which packet flow length distributions are estimated and compared with the actual data. No major differences are observed amongst the strategies based on the analysed data.