Mining Suspicious Tax Evasion Groups in Big Data

F. Tian, T. Lan, Kuo-Ming Chao, N. Godwin, Q. Zheng, Nazaraf Shah, F. Zhang

Research output: Contribution to journalArticle

12 Citations (Scopus)

Abstract

There is evidence that an increasing number of enterprises plot together to evade tax in an unperceived way. At the same time, the taxation information related data is a classic kind of big data. These issues challenge the effectiveness of traditional data mining-based tax evasion detection methods. To address this problem, we first investigate the classic tax evasion cases, and employ a graph-based method to characterize their property that describes two suspicious relationship trails with a same antecedent node behind an Interest-Affiliated Transaction (IAT). Next, we propose a Colored Network-Based Model (CNBM) for characterizing economic behaviors, social relationships, and the IATs between taxpayers, and generating a Taxpayer Interest Interacted Network (TPIIN). To accomplish the tax evasion detection task by discovering suspicious groups in a TPIIN, methods for building a patterns tree and matching component patterns are introduced and the completeness of the methods based on graph theory is presented. Then, we describe an experiment based on real data and a simulated network. The experimental results show that our proposed method greatly improves the efficiency of tax evasion detection, as well as provides a clear explanation of the tax evasion behaviors of taxpayer groups.
Original languageEnglish
Pages (from-to)2651 - 2664
JournalIEEE Transactions on Knowledge and Data Engineering
Volume28
Issue number10
Early online date8 Jun 2016
DOIs
Publication statusPublished - 1 Oct 2016

Fingerprint

Taxation
Graph theory
Big data
Data mining
Economics
Industry
Experiments

Bibliographical note

This paper is not yet available on the repository

Keywords

  • big data
  • Graph mining
  • tax evasion
  • interest-affiliated transaction
  • heterogeneous information network

Cite this

Mining Suspicious Tax Evasion Groups in Big Data. / Tian, F.; Lan, T.; Chao, Kuo-Ming; Godwin, N.; Zheng, Q.; Shah, Nazaraf; Zhang, F.

In: IEEE Transactions on Knowledge and Data Engineering, Vol. 28, No. 10, 01.10.2016, p. 2651 - 2664.

Research output: Contribution to journalArticle

Tian, F. ; Lan, T. ; Chao, Kuo-Ming ; Godwin, N. ; Zheng, Q. ; Shah, Nazaraf ; Zhang, F. / Mining Suspicious Tax Evasion Groups in Big Data. In: IEEE Transactions on Knowledge and Data Engineering. 2016 ; Vol. 28, No. 10. pp. 2651 - 2664.
@article{2d81cf01f0194930a8f3380dfe3698fd,
title = "Mining Suspicious Tax Evasion Groups in Big Data",
abstract = "There is evidence that an increasing number of enterprises plot together to evade tax in an unperceived way. At the same time, the taxation information related data is a classic kind of big data. These issues challenge the effectiveness of traditional data mining-based tax evasion detection methods. To address this problem, we first investigate the classic tax evasion cases, and employ a graph-based method to characterize their property that describes two suspicious relationship trails with a same antecedent node behind an Interest-Affiliated Transaction (IAT). Next, we propose a Colored Network-Based Model (CNBM) for characterizing economic behaviors, social relationships, and the IATs between taxpayers, and generating a Taxpayer Interest Interacted Network (TPIIN). To accomplish the tax evasion detection task by discovering suspicious groups in a TPIIN, methods for building a patterns tree and matching component patterns are introduced and the completeness of the methods based on graph theory is presented. Then, we describe an experiment based on real data and a simulated network. The experimental results show that our proposed method greatly improves the efficiency of tax evasion detection, as well as provides a clear explanation of the tax evasion behaviors of taxpayer groups.",
keywords = "big data, Graph mining, tax evasion, interest-affiliated transaction, heterogeneous information network",
author = "F. Tian and T. Lan and Kuo-Ming Chao and N. Godwin and Q. Zheng and Nazaraf Shah and F. Zhang",
note = "This paper is not yet available on the repository",
year = "2016",
month = "10",
day = "1",
doi = "10.1109/TKDE.2016.2571686",
language = "English",
volume = "28",
pages = "2651 -- 2664",
journal = "IEEE Transactions on Knowledge and Data Engineering",
issn = "1041-4347",
publisher = "Institute of Electrical and Electronics Engineers",
number = "10",

}

TY - JOUR

T1 - Mining Suspicious Tax Evasion Groups in Big Data

AU - Tian, F.

AU - Lan, T.

AU - Chao, Kuo-Ming

AU - Godwin, N.

AU - Zheng, Q.

AU - Shah, Nazaraf

AU - Zhang, F.

N1 - This paper is not yet available on the repository

PY - 2016/10/1

Y1 - 2016/10/1

N2 - There is evidence that an increasing number of enterprises plot together to evade tax in an unperceived way. At the same time, the taxation information related data is a classic kind of big data. These issues challenge the effectiveness of traditional data mining-based tax evasion detection methods. To address this problem, we first investigate the classic tax evasion cases, and employ a graph-based method to characterize their property that describes two suspicious relationship trails with a same antecedent node behind an Interest-Affiliated Transaction (IAT). Next, we propose a Colored Network-Based Model (CNBM) for characterizing economic behaviors, social relationships, and the IATs between taxpayers, and generating a Taxpayer Interest Interacted Network (TPIIN). To accomplish the tax evasion detection task by discovering suspicious groups in a TPIIN, methods for building a patterns tree and matching component patterns are introduced and the completeness of the methods based on graph theory is presented. Then, we describe an experiment based on real data and a simulated network. The experimental results show that our proposed method greatly improves the efficiency of tax evasion detection, as well as provides a clear explanation of the tax evasion behaviors of taxpayer groups.

AB - There is evidence that an increasing number of enterprises plot together to evade tax in an unperceived way. At the same time, the taxation information related data is a classic kind of big data. These issues challenge the effectiveness of traditional data mining-based tax evasion detection methods. To address this problem, we first investigate the classic tax evasion cases, and employ a graph-based method to characterize their property that describes two suspicious relationship trails with a same antecedent node behind an Interest-Affiliated Transaction (IAT). Next, we propose a Colored Network-Based Model (CNBM) for characterizing economic behaviors, social relationships, and the IATs between taxpayers, and generating a Taxpayer Interest Interacted Network (TPIIN). To accomplish the tax evasion detection task by discovering suspicious groups in a TPIIN, methods for building a patterns tree and matching component patterns are introduced and the completeness of the methods based on graph theory is presented. Then, we describe an experiment based on real data and a simulated network. The experimental results show that our proposed method greatly improves the efficiency of tax evasion detection, as well as provides a clear explanation of the tax evasion behaviors of taxpayer groups.

KW - big data

KW - Graph mining

KW - tax evasion

KW - interest-affiliated transaction

KW - heterogeneous information network

U2 - 10.1109/TKDE.2016.2571686

DO - 10.1109/TKDE.2016.2571686

M3 - Article

VL - 28

SP - 2651

EP - 2664

JO - IEEE Transactions on Knowledge and Data Engineering

JF - IEEE Transactions on Knowledge and Data Engineering

SN - 1041-4347

IS - 10

ER -