21 Questions 30 Answers 0 Followers
Questions related from Shivi Bhatia
Normality is very essential for Linear models specially for the independent variables. However during one of the projects i had one of the stakeholders ask if for binary classifiers or...
04 April 2018 5,130 1 View
HI All, I am trying to learn more about Bootstrap Sampling. This is one of the technique used in Random Forest algorithm for example and many other places in statistics. What and why would this...
11 November 2017 6,313 0 View
Hi All, Need some advice and suggestion on one of the use case i have for one of my existing clientele. We have IT ticket queue where we get tickets from across the globe, to IT helpdesk. They...
09 September 2017 2,156 3 View
HI All, I am working on a C-SAT model. The outcome variable is 0 and 1 i.e. either Dis-Sat or C-Sat. There are more than 100 predictor variables i have from business. What are the best ways to...
09 September 2017 7,375 3 View
HI All, I have a multi level dependent variable. The dependent variable is Tenure - less than 1 year, 1-2 year and above 2 years. The independent variable is AHT. I need to conduct a...
07 July 2017 4,776 0 View
I am building a classification model using Random Forrest, Decision Trees & Logistic regression. For one of my critical variable i have some missing values. In some earlier models during...
05 May 2017 7,895 4 View
Hi All , I am trying to read few csv files for some text mining assignment. I have used this command to check the # of files in the working directory: length(dir(path="D:/Shivi/R Project",...
03 March 2017 3,564 3 View
Hi All, One of my known source is predicting house prices given multiple independent variables. In this case i would have used the old war house linear regression. However he is inclined towards...
09 September 2016 4,760 6 View
Hi Team, I know this question has been asked zillion times but even after consulting Stack Overflow & other forum cant figure out the reason. I have one var in my data-set names case age....
08 August 2016 5,070 4 View
Dear All, As stated in the heading: I need to understand when any of two model fit stats:Hosmer-Lemeshow Goodness of Fit & Kolmogorov-Smirnov Test should be used. As per the...
08 August 2016 1,349 5 View
Dear Team, I am going through a series of videos where WOE/IV (weight of evidence & information value) has been used to highlight outliers and missing values. These techniques are useful for...
08 August 2016 9,308 4 View
Dear Team, I need to replicate the below Pearson test in R as i have done in SAS. Basically this is a survey data where there are 4 rating and then i have a final rating which is the major one...
06 June 2016 8,057 5 View
Hi Team, This scenario may have come across a number of times however i checked nabble & SO and couldn't find a solution hence request assistance. I have a date variable in my data-set eir....
01 January 1970 1,960 6 View
I am working on a sales data for one of our customers and performing some exploratory analysis.We have almost 100 million rows of data. As this is a sales data/ purchase data hence the gender is a...
01 January 1970 2,397 6 View
HI Team, I am running a logistic regression model for one of my business scenario. The outcome is 0 (bad) and 1(good). I have divided the data into 2 parts:(training and test) splits
01 January 1970 5,480 4 View
Hi All, I am working on a twitter analysis using the TM package. Below are some codes: 1- Here i am creating a data frame of the data collected from...
01 January 1970 4,492 1 View
Dear Team, After using weight of evidence & Information value mechanism, of the 40 odd variables i am left with 8 variables which are highly or moderately significant. One of the independent...
01 January 1970 7,133 4 View
Hi Team, I am creating my first Logistic regression on R Studio. I am working on a C-SAT data where rating (score) 0-8 is a dis-sat whereas 9-10 are SAT. As these were in numeric form so i had as...
01 January 1970 6,795 11 View
Hi Team, Good Day. I am working on a text mining assignment and have built the document matrix using the tm package. Now I need to run findAssocs from my dtm with some word say 'like' with a...
01 January 1970 7,883 6 View
Dear All, I need to work on Speech Analytics for a client requirement. This is big data leading to approx 500 GB of data. We have calls that needs to be converted into text and then with the help...
01 January 1970 1,066 3 View
Dear Team, I need some insight on Fisher Scoring. I am building a logistic regression model and see Fisher Scoring. It says :-" Number of Fisher Scoring iterations: 6". This as i researched is...
01 January 1970 2,056 7 View