How does Scalability affect Collaborative Filtering?

More Folasade O. Isinkaye's questions See All

How to import dataset from local-storage/google api into google colab or jupyter notebook?

I want to import an image dataset into google colab or jupyter notebook for me to train it using tensorflow and keras (ml). I am having difficulty in importing the dataset into the colab and...

06 October 2019 8,107 1 View

Full meaning of NDMP metric in recommender systems?

What is the full meaning of this accuracy metric used for measuring estimated ranking in recommender systems?

05 February 2018 4,828 3 View

How to Solve Regularized Optimization problem?

Can anyone be of help to me on how to obtain the mathematical equation for the Regularized Optimization problem in the PDF file below using Stochastic gradient Descent with bootstrap sampling for...

01 January 2018 6,478 2 View

Expected Popularity Complement (EPC) metric?

Please, I want explanations on the use of of this novelty metric for recommender system proposed by S.Vargas (2011) - Expected Popularity Complement (EPC). materials that could help will be...

26 December 2017 9,679 1 View

Simplified materials on the application of BPR SLIM in recommender system

Please, I need simplified materials that can give me thorough understanding of the following models (Bayesian Probabilistic Ranking and Sparse Linear Methods), as applied to collaborative...

18 June 2017 9,276 2 View

Assigning cluster vectors to cluster centers

Can anybody explain how to assign cluster vectors to appropriate cluster centers after performing Kmeans clustering? I am working with R/Rstudio, is there any library that does it? or can anybody...

12 June 2017 6,726 3 View

I need an explanation on RSVD and SVD Results.

I need an explanation on RSVD and SVD Results. I run an experiment in Rstudio using the same datasets on RSVD and SVD functions. I got the same result. What can be the explanation for this? Thks

01 June 2017 2,515 1 View

How good is Azure machine learning studio?

Compare with other analytic/Recommender tools/library like WEKA, R-studio, GitHub, Surprise, MyMediaLite, LensKit, LibRec etc. How good is Azure machine learning studio?

23 May 2017 1,047 3 View

How is BPR SLIM used with Collaborative Filtering to generate recommendations?

1. What are the advantages of using BPR SLIM with recommender system? 2. Could it be combined with Dimensionality reduction and Clustering to generate recommendations? 3. What are the steps to...

19 May 2017 1,450 0 View

Apart from F-1 measure, which other metrics can one use to evaluate binary rating data?

In evaluating recommender system with binary rating data, which other evaluation metrics can one use aside F-1 (Precision and Recall) measure for better accuracy?

09 May 2017 1,727 3 View

I need the datasets of Microgrid for system identification?

Hi I am working on data driven model of the microgrid, for that, i need the reliable datasets for the identification of MG data driven Model. Thanks

02 August 2024 5,748 4 View

Is the black sediment consider as sand? If not, how can I filter it out?

Hi researchers! I'm working on soil texture analysis, and the end result for sand is doubtful because there is black sediment appearing after drying, as shown in the figure. Is it considered sand?...

30 July 2024 557 2 View

Which file formats are accepted for supplementary material?

I have a dataset consisting of json files. i tried to upload a zip or tar of it but the system tells me that the file format is not accepted... br

25 July 2024 1,316 3 View

Dataset of synchronized cardiac angiography and ECG?

Hello, I'm working on medical project and I would need synchronized angiography with ECG? Does anyone know if some open source dataset of this kind exist? Regards, Bruno

25 July 2024 2,214 2 View

Can we isolate microplastic from sludge sample without using vacuum filter..?

isolation of microplastic from sludge sample using centrifugation ..

23 July 2024 6,418 0 View

How to Select the most suitable machine learning algorithm depending on the characteristics of the given dataset ?

I'm working on a project that involves analyzing a new dataset, and I'm at the stage of selecting the most appropriate machine learning algorithm. The dataset consists of both numerical and...

22 July 2024 6,097 7 View

How to use evolutionary algorithms with real parameters in ryu sdn controller with large scale?

Hi, I wanna to implement evolutionary algorithms in ryu sdn controller in mininet, i have some challenges, how i can run the big scale topo with one sdn contoller??? and another question is to...

21 July 2024 246 2 View

How to use NCBI datasets ?

I have been trying to extract genome from NCBI using their dataset tool, however some examples seem not to work : ./datasets download genome taxon "Homo Sapiens" --annotated --assembly-level...

20 July 2024 1,339 2 View

How do I access .vcf files without an R statistical package?

I am currently working on a mendelian randomization study, and I have downloaded the datasets needed from the ieu opengwas project (mrcieu.ac.uk) in .vcf format. I do not have access to an R...

19 July 2024 2,342 5 View

Which is the best approach for anomaly detection in scanned image data set?

Anomaly detection in scanned image data set

18 July 2024 3,578 3 View

Qamar Ul Islam

Dear Folasade O. Isinkaye

In general, the whole ratings database is searched in collaborative filtering and thus it suffers from poor scalability when more and more users and items are added into the database.

Kind Regards

Sumaia Mohammed Al-Ghuribi

Collaborative filtering relies on calculating the similarities among users or items to find the appropriate neighbors and the number of users and items in a system grows rapidly. For example, the behavior of such a user per day may result in his stored data reaching the size of TBs in some popular websites. Furthermore, the RS should respond in less than a second to keep users satisfied and to enable them to continuously engaged with the RS . As a result, both large-scale datasets and responding time create a challenge in designing efficient RS and as a result, it demands colossal computing resources. please refer to:

Article Multi-Criteria Review-Based Recommender System – The State of the Art

Mukul Lokhande

Since a collaborative filtering algorithm is mainly based on similarity measures computed over the co-rated set of items, the large levels of sparsity can lead to less accuracy and can challenge the predictions or recommendations of the collaborative filtering (CF)systems. Lets assume example Commercial recommender systems in general are used to evaluate very large product sets. In a user – item rating database, though users are very active, there are a few rating of the total number of items available. The user-item matrix is thus extremely sparse. Further, a CF algorithm is assumed to be efficient if it is able to filter items that are interesting to users. But, they require computations that are very expensive and grow non-linearly with the number of users and items in a database. In general, the whole ratings database is searched in collaborative filtering and thus it suffers from poor scalability when more and more users and items are added into the database. Instigated by these challenges, two collaborative filtering algorithms, firstly an algorithm based on weighted slope one scheme and item clustering & secondly an algorithm based on item classification & item clustering were studied, which dealt with the sparsity and scalability issues simultaneously.

Folasade O. Isinkaye

Thank you all for your valuable answers.