ARGOS Project: Data Mining by Clustering

Thursday, October 1, 2015

Data Mining by Clustering

Clustering is one of the most used techniques in the Data Mining field. Its objective, in ARGOS project, is to help analyzing existing data by subdividing these data into a small number of clusters or (groups) where the events inside any specific cluster are very similar to each other and very dissimilar to the events of the other clusters.

As a result of the clustering process, several important information will be obtained among which two major ones can be quoted:

• The reduction of a large amount of events into a smaller number of events which contain almost all of the useful and relevant information

• the highlighting of regular and recurring events on the one hand which represent the frequent and “normal” events and relatively rare events on the other hand which represent infrequent and eventually “abnormal” events

Each cluster is described by the most important attributes that participated to the grouping of its events. This leads to the discovery of the profiles of the "normal" events as well as the "abnormal" ones.

The clustering method we use in Argos project is “Relational Analysis theory” [2]. This method has very powerful advantages with respect to the k-means method (the most used method for clustering), among which we can quote:

• no need to fix arbitrary the number of clusters to be found in data.

• no need to fix the clusters’ centroids (like in the k-means method)

This method has been applied in several fields (insurance, banking, video recordings, marketing, etc.). An example of its use in video recordings can be found in [1].

By Dr. Hamid Benhadda and Mikael Griffoulieres - Thales Services

References

[1] H. Benhadda, J.L. Patino, E. Corvee, F. Bremond, and M. Thonnat. “Data mining on large video recordings”. Colloque V.S.S.T.2007 : Veille Stratégique Scientifique & Technologique, 21-25 Octobre, Marrakech, 2007.

[2] Mustapha Lebbah, Younes Bennani and Hamid Benhadda. “Relational Analysis for Consensus Clustering from Multiple Partitions”. Machine Learning and Applications. ICMLA 2008: Seventh International Conference. pp 218- 223. San Diego, California, December 11-13, 2008.

18 comments:

UnknownOctober 18, 2015 at 2:04 PM
Big data is set to take the healthcare industry to the next-level of profit making. However it is imperative that healthcare institutions take a more holistic, patient-centric approach that focuses on superior health-care results and treatment expenditures. See more benefits of data mining in healthcare

ReplyDelete
Replies
TejutejuJune 11, 2018 at 3:51 PM
It was really a nice article and i was really impressed by reading this
Big data hadoop online training Bangalore
ReplyDelete
Replies
nishaJune 8, 2020 at 9:22 AM
The Blog is very creative, really appreciate those who are working behind of this article.

Data Science Training Course In Chennai | Data Science Training Course In Anna Nagar | Data Science Training Course In OMR | Data Science Training Course In Porur | Data Science Training Course In Tambaram | Data Science Training Course In Velachery
ReplyDelete
Replies
deviJuly 31, 2020 at 9:33 AM
Good Post! Thank you so much for sharing this pretty post, it was so good to read and useful to improve my knowledge as updated one, keep blogging.
Data Science Training In Chennai

Data Science Online Training In Chennai

Data Science Training In Bangalore

Data Science Training In Hyderabad

Data Science Training In Coimbatore

Data Science Training

Data Science Online Training

ReplyDelete
Replies
EXCELRSeptember 23, 2020 at 2:59 PM
Thanks a lot very much for the high quality and results-oriented help. I won’t think twice to endorse your blog post to anybody who wants and needs support about this area. data science training in Hyderabad
ReplyDelete
Replies
360digiTMG TrainingDecember 30, 2020 at 7:44 AM
I would also motivate just about every person to save this web page for any favorite assistance to assist posted the appearance.
Digital Marketing Training Institutes in Hyderabad
ReplyDelete
Replies
360digiTMG TrainingMarch 4, 2021 at 8:32 AM

Very informative post ! There is a lot of information here that can help any business get started with a successful social networking campaign !

Best Data Science Courses in Hyderabad

ReplyDelete
Replies
data scientist courseMarch 30, 2021 at 8:09 AM
I’m happy I located this blog! From time to time, students want to cognitive the keys of productive literary essays composing. Your first-class knowledge about this good post can become a proper basis for such people. nice one
data scientist certification
ReplyDelete
Replies
data scientist courseJune 29, 2021 at 11:14 AM
Wonderful blog post. This is absolute magic from you! I have never seen a more wonderful post than this one. You've really made my day today with this. I hope you keep this up!
data scientist training and placement in hyderabad
ReplyDelete
Replies
DeekshithaNovember 17, 2021 at 12:49 PM
Informative blog
cloud computing training institute in kolkata
ReplyDelete
Replies
ProCrackHere.comDecember 13, 2021 at 4:20 PM

I guess I am the only one who came here to share my very own experience. Guess what!? I am using my laptop for almost the past 2 years, but I had no idea of solving some basic issues. I do not know how to Download Cracked Pro Softwares But thankfully, I recently visited a website named procrackhere.com
Awesome Miner rack
ReplyDelete
Replies
360digitmgMay 13, 2022 at 6:57 AM
It would help if you thought that the data scientists are the highest-paid employees in a company.data science course in kochi
ReplyDelete
Replies
data science course in gorakhpurMay 22, 2022 at 5:43 PM
The first and foremost thing when learning data science is the discovery of data insight. In this aspect, the raw data is analyzed to gather information from raw data.
data science course in gorakhpur</a
ReplyDelete
Replies
Career Academic instituteJune 7, 2022 at 7:24 PM

Develop technical skills and become an expert in analyzing large sets of data by enrolling for the Best Data Science course in Bangalore. Gain in-depth knowledge in Data Visualization, Statistics, and Predictive Analytics along with the two famous programming languages and Python. Learn to derive valuable insights from data using skills of Data Mining, Statistics, Machine Learning, Network Analysis, etc, and apply the skills you will learn in your final Capstone project to get recognized by potential employers.

Data Science in Bangalore

ReplyDelete
Replies
Professional Career TechnologyJune 8, 2022 at 12:46 PM
Data Science helps in buyer retention by figuring out the triggers and churns of a business.

Data Analytics Course in Calicut
ReplyDelete
Replies
traininginstituteSeptember 21, 2022 at 11:50 AM
I think this is a really good article. You make this information interesting and engaging. You give readers a lot to think about and I appreciate that kind of writing.
data science training
ReplyDelete
Replies
ashishAugust 22, 2023 at 9:29 AM
In the heart of Gurgaon's technological landscape, APTRON's Data Science Institute in Gurgaon stands as a hub of excellence. Its comprehensive curriculum, expert faculty, practical approach, top-notch infrastructure, placement assistance, and networking opportunities make it a standout choice for individuals aspiring to excel in the field of data science. By choosing APTRON, you're not just enrolling in an institute – you're embarking on a transformative journey toward becoming a proficient data scientist ready to conquer the data-driven world.
ReplyDelete
Replies
ashishSeptember 8, 2023 at 9:26 AM
Are you ready to embark on a rewarding journey into the world of data science? Look no further than APTRON, the foremost Data Science Training Institute in Gurgaon . In today's data-driven world, mastering the art of data science is a surefire way to boost your career prospects, and APTRON is here to guide you every step of the way.
ReplyDelete
Replies

Add comment