Find Jobs
Hire Freelancers

Credit risk modelling in r Programming

$30-250 USD

Cancelado
Publicado hace casi 7 años

$30-250 USD

Pagado a la entrega
You now have walked through the entire process of building a model from raw data. This is a skillset which sets you apart from other job applicants. The final project puts these skills all together. You are free to use code etc that others have shared in the past however the final project should be a work of yours not a group. 1. Use the data in the contents tab labeled Prosper.csv. Create a good ‘risk’ model. The main difference with the midterm will be twofold: a. For the model building use SAS eMiner. b. Greater thoroughness in the descriptive statistics, the writeup/documentation, the formatting of the report, and in how you came to your conclusions. 2. Descriptive stats. For each variable in the dataset summarize it (max,min,mean,etc) and make a histogram. Label and format appropriately. Do the same for any variables you create. Comment on anomalies in your data and what you did to address them. Defend your reasoning. It might be easiest to just make one page per variable. You should be very thorough in this area. There is code and some of the midterm papers and presentations posted in d2l for you to learn from. Whether you use R or SAS for the descriptive statistics is unimportant however it would seem to me that R would be best given you have worked in that environment more at this point. 3. Create a logistic regression model. In your attempt you should justify why you used the variables you did and for each rejected variable explain why you rejected it. Many groups on the midterm did a little hand waving here which is fine but now that you have gone through it you need to be more thorough and specify each variable. The variables you use in the final model should be binned and transformed to WOE. This is a change from the midterm. On the midterm there were several variables that had ‘goofy’ values. There were 999’s. There were odd values of outliers. There were categorical variables that were too many categories. A WOE transformation can ‘fix’ all of this. You can do the WOE transformation in R if you like or in eMiner the ‘interactive grouping’ node is specially made just for this task. If you use the interactive grouping node DO NOT simply accept the base grouping SAS throws at you. Go in and adjust the bins/groups in a logical fashion. Use the WOE_variables as the inputs to the model rather than the original variables. 4. For modeling you should sample the data into two groups. Typically I use 60/40 but you can use 70/30 or 50/50 if you like. 5. In addition to the above logistic regression you should attempt a more advanced model in eMiner and report its results in comparision. This might be a form of Neural Net, LARS, Dmine regression, etc. There are a number of them in eMiner. This is to show how different models can be applied in a straightforward manner once the problem has been set up correctly. For the ‘other’ model it is preferable you not use the WOE variables since those are special purpose things for logistic regression. It isn’t invalid to use WOE but it is more interesting to compare without the WOE. 6. Submit your report and any code you made. Neatness, organization, style, etc will be part of the consideration. The two nodes that can help you here are the ‘model comparison’ node which makes nice output and ROC curves and tables and such. The second node is the ‘reporter’ node which outputs a complete log of the process. In the reporter node when you are selecting the parameters you can make your life easier by telling it to create ‘rtf’ which is a ‘rich text file’ which opens in Word. Additionally you can alter the quality/type of output as shown in class. 7. We will ‘present’ during the final exam period. I will randomly draw names to present. You should prepare and submit a presentation that is not longer than 15 minutes. In other words we’ll have enough time to do several.
ID del proyecto: 13827745

Información sobre el proyecto

Proyecto remoto
Activo hace 7 años

¿Buscas ganar dinero?

Beneficios de presentar ofertas en Freelancer

Fija tu plazo y presupuesto
Cobra por tu trabajo
Describe tu propuesta
Es gratis registrarse y presentar ofertas en los trabajos

Sobre este cliente

Bandera de UNITED STATES
Brookings, United States
0,0
0
Miembro desde abr 24, 2017

Verificación del cliente

Otros trabajos de este cliente

credit risk analysis
$30-250 USD
¡Gracias! Te hemos enviado un enlace para reclamar tu crédito gratuito.
Algo salió mal al enviar tu correo electrónico. Por favor, intenta de nuevo.
Usuarios registrados Total de empleos publicados
Freelancer ® is a registered Trademark of Freelancer Technology Pty Limited (ACN 142 189 759)
Copyright © 2024 Freelancer Technology Pty Limited (ACN 142 189 759)
Cargando visualización previa
Permiso concedido para Geolocalización.
Tu sesión de acceso ha expirado y has sido desconectado. Por favor, inica sesión nuevamente.